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FIBER SHAFT MODIFICATIONS FOR EFFICIENT TARGETING 
RELATED APPLICATIONS 

Benefit of priority is claimed to U.S. provisional application Serial No. 
60/350,388, filed 24 January 2002, entitled "FIBER SHAFT MODIFICATIONS 
5 FOR EFFICIENT TARGETING," to Stevenson, Susan C, Kaleko, Michael, Smith, 
Theodore and Nemerow, Glen R., and to U.S. provisional application Serial No. 
60/391,967, filed 26 June 2002, entitled "FIBER SHAFT MODIFICATIONS FOR 
EFFICIENT TARGETING," to Stevenson, Susan C, Kaleko, Michael, Smith, 
Theodore and Nemerow, Glen R. This application is also related to International 

10 PCT application No. (attorney docket number 22908-1236), filed the same day 
herewith, entitled "FIBER SHAFT MODIFICATIONS FOR EFFICIENT 
TARGETING," to Stevenson, Susan C, Kaleko, Michael, Smith, Theodore and 
Nemerow, Glen R. Where permitted, the subject matter of each of these 
applications is incorporated by reference herein. 

1 5 FIELD OF INVENTION 

The present invention generally relates to the field of adenoviral vectors 
and the production of such vectors. In particular, detargeted adenoviral vectors 
are provided. 
BACKGROUND 

20 Most, if not all, adenoviral vector-mediated gene therapy strategies aim to 

transduce a specific tissue, such as a tumor or an organ. Systemic delivery will 
require ablation of the normal virus tropism as well as addition of new 
specificities. Multiple interactions between adenoviral particles and the host cell 
are required to promote efficient cell entry (Nemerow (2000) Virology 274:1-4). 

25 An adenovirus entry pathway is believed to involve two separate cell surface 
events. First, a high affinity interaction between the adenoviral fiber knob and 
coxsackie-adenovirus receptor (CAR) mediates the attachment of the adenovirus 
particle to the cell surface. A subsequent association of penton with the cell 
surface integrins a v j3 3 and a y /3 s , which act as co-receptors, potentiates virus 

30 internalization. There are a plurality of adenoviral fiber receptors, which interact 
with the group B (e.g., Ad3) and group C (e.g., Ad5) adenoviruses. Both of 
these groups of adenoviruses appear to require interaction with integrins for 
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internalization. CAR ablation, however, does not change biodistribution and 
toxicity of adenoviral vectors in vivo {Alemany et aL (2001) Gene Therapy 
3:1347-1353; U.S. patent application No. 09/870,203, filed May 30, 2001, and 
published as U.S. Published application No. 20020137213). Thus, the role of 
5 CAR interaction for in vivo gene transfer is not clear. Recently published studies 
have described conflicting results (Alemany et aL (2001) Gene Therapy 
5:1347-1353; Leissner et aL (2001) Gene Therapy 5:49-57; Einfeld et aL (2001) 
J. Viroiogy 75'A 1284-1 1291). For example, it has been shown that vectors 
containing an S408E mutation in the Ad5 fiber AB loop yield efficient liver 
10 transduction in mice, despite having greatly reduced transduction efficiencies on 
cells in culture (see, Leissner et aL (2001) Gene Therapy 5:49-57). In contrast, 
vectors containing a more extensive fiber AB loop mutation showed a 10-fold 
reduction in liver gene expression (see, Einfeld et aL (2001) J, Virology 
75:1 1284-1 1291). 

15 A doubly ablated adenovirus has been prepared by modifying the CAR 

binding region in the fiber loop and the integrin binding region in the penton base 
(Einfeld et aL (2001 ) J. Virology 75:11284-11291): This doubly ablated 
adenovirus, lacking CAR and integrin interactions, was reported not only to lack 
in vitro transduction of various cell types but also to lack in vivo transduction of 

20 liver cells. Specifically, the doubly ablated adenovirus was reported to have a 
700 fold reduction in liver transduction when compared to the non-ablated 
adenovirus. These results, however, were not reproduced by others. 

For many applications, the most clinically useful adenoviral vector would 
be deliverable systemically, such as into a peripheral vein, and would be targeted 

25 to a desired location in the body, and would not have undesirable side effects 
resulting from targeting to other locations. In vivo adenoviral vector targeting is 
a major goal in gene therapy and a significant effort has been focused on 
developing strategies to achieve this goal. Successful targeting strategies would 
direct the entire vector dose to the appropriate site and would be likely to 

30 improve the safety profile of the vector by permitting the use of lower, less toxic 
vector doses, which potentially also can be less immunogenic. Thus, there is a 
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need to develop adenoviruses which are fully detargeted in vivo for use as a 
base vector for producing redirected adenoviruses. 

Therefore, among the objects herein, it is an object herein to provide fully 
detargeted adenoviral vectors, methods for preparation thereof, and uses 
5 thereof. 
SUMMARY 

Detargeted and fully detargeted adenoviral particles, adenovirus vectors 
from which such particles are produced, methods for preparation of the vectors 
and particles and uses of the vectors and particles are provided. Provided and 

10 described are capsid modifications, such as fiber shaft modifications, and the 
resuling proteins that, when expressed on adenoviral particles provide for 
Retargeting of adenoviral vectors. The capsid modifications, such as the fiber 
shaft modifications, can be combined with other modifications, such as fiber 
knob and/or penton modifications, to produce fully ablated (detargeted) 

15 adenoviral particles. Thus, adenoviral vectors and adenovirual particles whose 
native tropisms are ablated through a modification or modifications of capsid 
proteins, particularly a fiber shaft region, are provided. 

Thus, provided are capsid mutiations, including fiber shaft modifications, 
that ablate binding to particular receptors, thereby permitting efficient targeting 

20 of adenoviral vectors that contain capsids with such modifications. For example, 
adenoviral vectors in which the fiber shaft's interaction with HSP is ablated 
(reduced or substantially eliminated), particuarly in vivo, are provided. These 
fiber shaft modifications can be combined with other modifications, such as fiber 
knob and/or penton modifications, to produce fully ablated (detargeted) 

25 adenoviral vectors. Also provided are retargeted vectors and particles that 

include a ligand or ligands to provide for targeting of the detargeted vectors and 
particles to selected cells and/or tissues. Retargeting can be effected, for 
example, by manipulating the fiber protein to redirect the receptor specificity to a 
particular cell type. 

30 Also provided are nucleic acids encoding the modified fiber proteins and 

also modified penton proteins. Also provided are nucleic acids encoding the 
modified fiber shaft protein that has ablated HSP binding and combinations 
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thereof with other modified fiber regions or other proteins, such as a modified 
fiber knob region and/or the modified penton protein. The nucleic acids also can 
contain heterologous nucleic acid sequences, such as promoters or nucleic acid 
sequences encoding polypeptides. The viral particles that express fibers 
5 containing such shaft modifications and other modifications are also provided. 

Also provided are methods for making and using the adenoviral particles 
that express the modified fibers and combinations of modified fibers and 
modified penton. With the fiber shaft modifications, particularly in combination 
with the fiber knob modifications and the penton modifications, the adenovirus 

10 particles are ablated for binding to their natural cellular receptor(s), i.e., they are 
detargeted. They can then be "retargeted" to a specific cell type through the 
addition of a ligand to the virus capsid, which causes the virus to bind to and 
infect such cell. The ligand can be added, for example, through genetic 
modification of a capsid protein gene. 

15 Also provided is a method for reducing liver toxicity in adeno- 

viral-mediated therapy. In contrast to the results of Einfeld eta/. (Einfeld et af. 
(2001) J. Virology 75'A 1284-1 1291), it is shown herein that a doubly ablated 
adenovirus, lacking CAR and integrin interactions, is capable of in vivo liver 
transduction. It is shown herein that ablation of liver transduction requires 

20 further and/or alternative modification(s). The method for reducing liver toxicity 
in adenoviral-mediated therapy includes 

modifying an adenoviral vector to ablate native tropism to liver cells in vivo. 
Such vector can be administered to a subject. The modifications include the 
modifications described herein. 

25 The nucleic acids, proteins, adenoviral particles and adenoviral vectors 

have a variety of uses. These include in vivo and in vitro uses to target nucleic 
acid to particular cells and tissues, for therapeutic purposes, including gene 
therapy, and also for the identification and study of cell surface receptors and 
identification of modes of interaction of viruses with cells. 

30 In particular, adenoviral fiber shaft modifications that ablate viral 

interaction with HSP (Heparin Sulfate Proteoglycans; also referred to as heparin 
sulfate glycosaminoglycans) are provided. These modifications include 
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mutations of individual amino acids in the fiber shaft that interact with HSP or 
mutations of amino acids in the fiber shaft that modify the ability of the HSP 
binding motif to interact with HSP. Adenoviral fiber shaft modifications also 
include replacements of fiber shafts using fiber shafts of adenoviruses, such as, 
5 for example, Ad3, Ad35 and Ad41 short fiber shaft, that do not contain HSP 
binding sites. 

Also provided are adenoviral fiber shaft modifications that alter, 
particularly ablate viral interaction with HSP, as described above, in combination 
with fiber knob modifications that ablate viral interaction with CAR. The fiber 

10 knob modifications include: (a) mutations of individual amino acids in the fiber 
loop that interact with CAR, such as, for example, AB or CD loop modifications; 
(b) mutations of individual amino acids in the fiber loop that modify the ability of 
the CAR binding motif to interact with CAR; and (c) replacements of fiber knobs 
using adenoviruses that do not interact with CAR, such as, for example, Ad3 

15 fiber knob, Ad41 short fiber knob, or Ad35 fiber knob. 

Also provided are adenoviral fiber shaft pnodifications as described above 
in combination with penton modifications that ablate viral interaction with a v 
integrins. The penton modifications include: (a) mutations of individual amino 
acids that interact with cr v integrins; (b) mutations of individual amino acids that 

20 modify the ability of the a v integrin binding motif to interact with the a v integrins; 
and (c) replacement of penton proteins using penton proteins from adenoviruses 
that do not interact with the a y integrins. 

Also provided are adenoviral fiber shaft modifications as described above 
in combination with fiber knob modifications as described above and penton 

25 modifications as described above. 

Also provided is a scale-up method for the propagation of detargeted 
adenoviral vectors. The method uses polycations and/or bifunctional reagents, 
which when added to tissue culture medium results in entry of adenoviral 
particles into the producer cells. 
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Provided aer recombinant viral particles that contain a modified capid 
protein whereby binding to heparin sulfate proteoglycans (HSP) is reduced or 
eliminated compared to particles that contain unmodified capsid proteins. The 
modified capsid proteins include fibers proteins with modified shafts such that 
5 binding to HSP is reduced or eliminated. 

Among the particular embodiments the following are provided. Provided 
are adenovirus capsid proteins that are modified to alter, typically reduce or 
eliminate, binding to or interaction with in vivo and/or in vitro to heparin sulfate 
proteoglycan (HSP). HSPs are expressed on various cells, including 
10 hepatocytes. It is shown herein that HSPs provide for or participate in 

transduction of cells, such as liver cells. Since it can be desirable to eliminate or 
reduce such transduction, the modifications of the capsid proteins, such as fiber 
proteins, permit detargeting of particles that express such proteins from such 
cells. 

15 Thus provided are modified adenovirus fiber proteins that include a 

mutation, such as an insertion, deletion, change, replacement of amino acids or 
combinations thereof, whereby binding to or interaction with heparin sulfate 
proteoglycan (HSP) is altered. In particular, the he binding of the modified fiber 
protein is eliminated or reduced compared to the unmodified protein. Exemplary 

20 of these mutations are mutations in the shaft of a fiber, where the shaft also can 
include the tail. The mutations can reduce or alter the affinity of the fiber 
protein for HSP is reduced at least by 2-fold, 5-fold, 10-fold, 100-fold or more, 
including substantially eliminating it. 

As provided herein, fibers from adenoviruses that interact with HSP can 

25 include a motif, such as BBXB or BBBXXB, wherethe B is a basic amino acid and 
X is any amino acid, particularly the consensus sequence KKTK in Ad5 and Ad2. 
Thus, provided are fibers in which the motif is altered to eliminate or reduce 
interaction with HSP. 

Also provided are modified fiber protein of claim 1 that are chimeras in 

30 which the fiber shaft (or fiber shaft and tail) are derived from a fiber, such as 
Ad3, Ad35, Ad7, Ad1 1 , Ad1 6, Ad21 , Ad34, Ad40, Ad41 or Ad46 fiber, that 
does not interact with HSP and combined with fiber that does interact, such as 



WO 2003/062400 



PCT/US2003/002295 



Ad5 or Ad2 fiber, to produce a complete fiber whose binding to HSP is reduced 
or eliminated. 

All of the modified capsids proteins provided herein also can include one 
or more further modifications that reduce or eliminate interaction of the resulting 
5 fiber with one or more cell surface proteins, such as but not limited to, CAR and 
ar v integrin or other receptor to which a particular native fiber binds, in addition to 
HSP. These modifications include, but are not limited to, modification to fiber 
that reduces or eliminates CAR binding and modification to penton that reduces 
or eliminate a v integrin binding. The mutations can be in the fiber knob, shaft, 

10 tail and shaft, and also in penton. 

Any and all of the modified capsid proteins provided herein can further 
include a ligand that binds to a particular receptor thereby endowing a fiber (or 
other capsid protein) with binding specificity or the ability to interact with such 
receptor. The ligand can be inserted into any suitable site in a capsid protein, 

15 such as an insertion or replacement. For example, fibers with ligands inserted 
into the knob region are exemplified. Any such ligand can be employed and a 
variety are exemplified herein. 

A variety of modified capsid proteins are exemplified herein. These 
include, but are not limited to, fibers containing: the sequence of amino acids 

20 set forth in any of SEQ ID Nos. 52, 54, 56, 58, 62, 66, 70 and 72; or a 
sequence of amino acids having 60%, 70%, 80%, 90%, 95% or greater 
sequence identity with a sequence of amino acids set forth in any of SEQ ID 
Nos. 52, 54, 56, 58, 62, 66, 70 and 72; or a sequence of amino acids encoded 
by a sequence of nucleotides that hybridizes under conditions of high stringency 

25 along at least 70% of its length to a sequence of nucleotides that encodes a 

sequence of amino acids set forth in any of SEQ ID Nos. 52, 54, 56, 58, 62, 66, 
70 and 72. 

Nucleic acids encoding the capsid proteins, including the fibers are also 
provided. The nucleic acids can be provided as vectors, particularly as 
30 adenovirus vectors. Many adenoviral vectors are known and can be modified as 
needed in accord with the description herein. Adenoviral vectors include, but 
are not limited to, early generation adenoviral vectors, such as E1 -deleted 



WO 2003/062400 



PCT/US2003/002295 



vectors, gutless adenoviral vectors and replication-conditional adenoviral vectors, 
such as oncolytic adenoviral vectors. The adenovirus vectors also can include 
heterologous nucleic acids that encode or provide products, such as therapeutic 
products. Any therapeutic product is contemplated and a variety are set forth 
5 herein as exemplary. Heterologous nucleic acid can encode a polypeptide or 
comprise or encode a regulatory sequence, such as a promoter or an RNA, 
including RNAi, small RNAs, other double-stranded RNAs, antisense RNA, and 
ribozymes. Promoters include, for example, constitutive and regulated 
promoters and tissue specific promoter, including tumor specific promoters. 

10 The promoter can be operably linked, for example, to a gene of an adenovirus 
essential for replication. 

Cells containing the nucleic acid molecules and cells containing the 
vectors are also provided. Such cell include packaging cells. The cells can be 
prokaryotic or eukaryotic cells, including, mammalian cells, such a primate cells, 

15 including human cells. 

Also provided are adenoviral particles that contain the modified capsid 
proteins provided herein. The particles have altered interaction or binding with 
HSP compared to particles that do not contain the modified capsid proteins. In 
addition to altered binding to HSP, which is typically reduced or eliminated 

20 binding, the particles can include further modifications, such as capsid proteins 
with altered interaction with other receptors as described above. In particular, 
the particles can have altered, typically reduced or eliminated, interaction with 
CAR, a v integrin and/or other receptors. The mutation include mutations in the 
fiber knob, penton and hexon. Exemplary fiber know mutations are mutations in 

25 the AB loop or CD loop, such as KOI or K012, which are described herein. In 
addition, the particles can include additional ligands for retargeting to selected 
receptors. The adenoviral particles can be from any serotype and subgroup. 

Methods for expressing heterologous nucleic acids in a cell are provided. 
In these methods an adenoviral vector provided herein is transduced into a cell 

30 to deliver the nucleic acid and/or encoded products. Transduction can be 
effected in vivo or in vitro or ex vivo r and can be for a variety of purposes 
including study of gene expression and genetic therapy. The cells can be 
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prokaryotic cells, but typically are eukaryotic cells, including mammalian cells, 
such as primate, including human, cells. The cells can be of a specific type, such 
as a tumor cell or a cell in a particular tissue. The vectors can be oncolytic 
vector to effect killing of tumor cells. 
5 Since the modified capsid proteins herein have reduced or eliminated 

binding to HSP, viral particles containing such proteins exhibit ablated binding to 
HSP in vitro and in vivo. Thus provided is a method of reducing transduction of 
cells that express HSP, such as hepatocytes in the liver, by modifying a capsid 
protein, such as fiber to eliminate or reduce interaction with or binding to HSP. 

10 Such reduction reduces or eliminates transduction of cells that express HSP, 
including liver cells. 

Also provided are scale-up methods for the propagation of detargeted 
adenoviral particle, such as those provided herein. The method includes the 
steps of infecting or transducing a cell capable of replicating, maturing and 

15 packaging an adenoviral vector with a detargeted adenoviral vector in the 

presence of a reagent that results in entry of the adenoviral particle into the cell, 
such as a polycation and/or a bifunctional protein or other such reagent; and 
culturing the infected cell under conditions suitable for growth, spread and 
propagation of the adenoviral vector. The resulting adenoviral particles can be 

20 recovered. Polycations include, but are not limited to, hexadimethrine bromide, 
polyethylenimine, protamine sulfate and poly-L-lysine. Bifunctional proteins, 
include, but are not limited to, an anti-fiber antibody ligand fusion, an 
anti-fiber-Fab-FGF conjugate, an anti-penton-antibody ligand fusion, an 
anti-hexon antibody ligand fusion and a polylysine-peptide fusion. The ligand is 

25 selected to bind to a particular receptor. 

The viral particles that express a modified capsids provided herein can be 
produced by this method. The modification include, for example, one or more 
mutations selected from among mutations that reduce or eliminate interactions 
with one or more of ct v integrins, coxsackie-adenovirus receptors (CAR) and 

30 heparin sulfate proteoglycans (HSP). Such mutations include, for example, PD1, 
K01, KQ12 and S*. 
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BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 is a plasmid map for pSKOl . 

Figure 2 is a plasmid map for pNDSQS.IKOI. 

Figures 3A-3C are plasmid maps of pAdmireRSVnBg.(Fig. 3A), pSQI Fig. 
5 3B) and pSQ1K012 (Fig. 3C) 

Figure 4 is a plasmid map for pSQ1 PD1 . 

Figures 5A-5B are plasmid maps of pSQ1FK01PD1 {Fig. 5A) and 
pSQ1K012PD1 (Fig. 5B). 

Figure 6 shows in vitro transduction efficiency of A549 cells using 
10 adenoviral vectors containing fiber AB loop knob and/or penton, PD1 mutations. 
The following adenoviral vectors were used in these studies: AvInBg, 
Av1 nBgFKOI , referred to as FKOI, Av1nBgPD1, referred to as PD1, and 
Av1 nBgFKOI PD1 that is referred to as FK01PD1. 

Figure 7A-7B shows in vivo adenoviral-mediated liver gene expression 
15 (Fig. 7A) and hexon DNA content (Fig. 7B) using adenoviral vectors containing 
fiber AB loop knob and/or penton, PD1 mutations. The following adenoviral 
vectors were used in these studies: AvInBg, Av1 nBgFKOI, referred to as 
FKOI, Av1 nBgPDI , referred to as PD1, Av1 nBgFKOI PD1 , referred to as 
FK01PD1, Av1nBgK012, referred to as K012, and Av1 nBgKOI 2PD1 that is 
20 referred to as KOI 2PD1 . 

Figure 8 is a plasmid map for pFBshuttle(EcoRl). 

Figure 9 is a plasmid map for pSQ1 HSP. 

Figure 1 0 is a plasmid map for pSQ1 HSPKOI . 

Figure 1 1 is a plasmid map for pSQ1 HSPPD1 . 
25 Figure 1 2 is a plasmid map for pSQ1 HSPKOI PD1 . 

Figures 13A-13C show the transduction efficiency of A549 and HeLa 
cells using adenoviral vectors containing fiber shaft, knob and/or penton 
mutations. Fig. 13A shows the dose response for the transduction efficiency of 
A549 cells. Fig. 13B shows the transduction efficiency of HeLa cells at 2000 
30 ppc. Figure 13C shows the competition analysis of adenoviral vectors 
containing fiber shaft mutations. 
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Figures 14A-14B shows the influence of fiber shaft mutations on in vivo 
adenoviral-mediated liver gene expression (Fig. 14A) and hexon DNA content 
(Fig. 14B). 

Figures 15A-15B are plasmid maps of pSQIHSPRGD (Fig. 1 5A) and 
5 pSQIHSPKOIRGD (Fig. 15B). 

Figure 16 shows that insertion of a RGD targeting ligand can restore 
transduction of the vectors containing the HSP binding shaft S* mutation. 

Figures 17A-17B are plasmid maps of pSQ1 AD35Fiber (Fig. 1 7A) and 
pSQ1 Ad35FcRGD (Fig. 17B). 
10 Figures 18A-18B are maps of plasmids encoding 35F chimeric fibers. Fig. 

18A is a plasmid map of pSQ135T5H, and Fig. 1 8B is a plasmid map of 
pSQ15T35H. 

Figure 19 shows the results of an in vitro analysis of Ad5 vectors 
containing Ad35 fibers and derivatives thereof. 
15 Figure 20 shows the results of an in vivo analysis of Ad5 vectors 

containing Ad35 fibers and derivatives thereof. 

Figures 21A-21B are plasmid maps of pSQ1Ad41sF (Fig. 21 A) and 
pSQ1 Ad41 sFRGD (Fig. 21B). 

Figure 22 shows the results of an in vivo analysis of Ad5 vectors 
20 containing Ad41 short fiber. 

Figure 23 shows the in vitro analysis of Ad5 based vectors containing the 
Ad41 short fiber which has been re-engineered to contain a cRGD ligand in the 
HI loop. 

Figure 24 shows enhanced transduction of AE1 -2a cells with the 
25 Av3nBgFK01 detargeted adenoviral vector using hexadimethrine bromide (HB), 
protamine sulfate (PS) and poly-lysine-RGD (K14) or the anti-penton-TNFa 
bifunctional protein (apen-TNF). 

Figure 25 shows ablation of HSP interaction decreases 
adenoviral-mediated gene transfer to other organs 
30 Figure 26 shows in vivo liver transduction with adenoviral vectors which 

encode for B-galactosidase and contain various mutations to the fiber and/or 
penton proteins. Results are plotted as percent transduction as compared to 
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wild type. Two different methods for determining the level of transduction are 
shown for each vector. 

Figure 27 shows the adenoviral vector biodistribution to the liver and 
tumor for the vectors containing the S* # KOIS*, and 41 sF fibers. 
5 DETAILED DESCRIPTION 
A. DEFINITIONS 

Unless defined otherwise, all technical and scientific terms used herein 
have the same meaning as is commonly understood by one of skill in the art to 
which the invention(s) belong. All patents, patent applications, published 

10 applications and publications, Genbank sequences, websites and other published 
materials referred to throughout the entire disclosure herein, unless noted 
otherwise, are incorporated by reference in their entirety. In the event that there 
are a plurality of definitions for terms herein, those in this section prevail. 
Where reference is made to a URL or other such identifier or address, it 

15 understood that such identifiers can change and particular information on the 
internet can come and go, but equivalent information is known and can be 
readily accessed, such as by searching the internet and/or appropriate 
databases. Reference thereto evidences the availability and public dissemination 
of such information. 

20 As used herein, the term "adenovirus" or "adenoviral particle" is used to 

include any and all viruses that can be categorized as an adenovirus, including 
any adenovirus that infects a human or an animal, including all groups, 
subgroups, and serotypes. Depending upon the context reference to 
"adenovirus" can include adenoviral vectors. There are at least 51 serotypes of 

25 Adenovirus that classified into several subgroups. For example, subgroup A 

includes adenovirus serotypes 12, 18, and 31. Subgroup C includes adenovirus 
serotypes 1, 2, 5, and 6. Subgroup D includes adenovirus serotype 8, 9, 10, 13, 
15, 17, 19, 20, 22-30, 32, 33, 36-39, and 42-49. Subgroup E includes 
adenovirus serotype 4. Subgroup F includes adenovirus serotypes 40 and 41 . 

30 These latter two serotypes have a long and a short fiber protein. Thus, as 
used herein an adenovirus or adenovirus particle is a packaged vector or 
genome. 
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As used herein, "virus," "viral particle," "vector particle," "viral vector 
particle," and "virion" are used interchangeably to refer to infectious viral 
particles that are formed when, such as when a vector containing all or a part of 
a viral genome, is transduced into an appropriate cell or cell line for the 
5 generation of such particles. The resulting viral particles have a variety of uses, 
including, but not limited to, transferring nucleic acids into cells either in vitro or 
in vivo. For purposes herein, the viruses are adenoviruses, including recombinant 
adenoviruses formed when an adenovirus vector, such as any provided herein, is 
encapsulated in an adenovirus capsid. Thus, a viral particle is a packaged viral 

10 genome. An adenovirus viral particle is the minimal structural or functional unit 
of a virus. A virus can refer to a single particle, a stock of particles or a viral 
genome. The adenovirus (Ad) particle is relatively complex and may be resolved 
into various substructures. 

Included among adenoviruses and adenoviral particles are any and all 

15 viruses that can be categorized as an adenovirus, including any adenovirus that 
infects a human or an animal, including all groups, subgroups, and serotypes. 
Thus, as used herein, "adenovirus" and "adenovirus particle" refer to the virus 
itself and derivatives thereof and cover all serotypes and subtypes and naturally 
occurring and recombinant forms, except where indicated otherwise. Included 

20 are adenoviruses that infect human cells. Adenoviruses can be wildtype or can 
be modified in various ways known in the art or as disclosed herein. Such 
modifications include, but are not limited to, modifications to the adenovirus 
genome that is packaged in the particle in order to make an infectious virus. 
Exemplary modifications include deletions known in the art, such as deletions in 

25 one or more of the E1a, E1b, E2a, E2b, E3, or E4 coding regions. Other 

exemplary modifications include deletions of all of the coding regions of the 
adenoviral genome. Such adenoviruses are known as "gutless" adenoviruses. 
The terms also include replication-conditional adenoviruses, which are viruses 
that preferentially replicate in certain types of cells or tissues but to a lesser 

30 degree or not at all in other types. For example, among the adenoviral particles 
provided herein, are adenoviral particles that replicate in abnormally proliferating 
tissue, such as solid tumors and other neoplasms. These include the viruses 
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disclosed in U.S. Patent No. 5,998,205 and U.S. Patent No. 5,801,029. Such 
viruses are sometimes referred to as "cytolytic" or "cytopathic" viruses (or 
vectors), and, if they have such an effect on neoplastic cells, are referred to as 
"oncolytic" viruses (or vectors). 
5 As used herein, the terms "vector," "polynucleotide vector," 

"polynucleotide vector construct," "nucleic acid vector construct," and "vector 
construct" are used interchangeably herein to mean any nucleic acid construct 
that can be used for gene transfer, as understood by those skilled in the art. 
As used herein, the term "viral vector" is used according to its 

10 art-recognized meaning. It refers to a nucleic acid vector construct that includes 
at least one element of viral origin and can be packaged into a viral vector 
particle. The viral vector particles can be used for the purpose of transferring 
DNA, RNA or other nucleic acids into cells either in vitro or in vivo. Viral vectors 
include, but are not limited to, retroviral vectors, vaccinia vectors, lentiviral 

15 vectors, herpes virus vectors (e.g., HSV), baculoviral vectors, cytomegalovirus 
(CMV) vectors, papillomavirus vectors, simian virus (SV40) vectors, Sindbis 
vectors, semliki forest virus vectors, phage vectors, adenoviral vectors, and 
adeno-associated viral (AAV) vectors. Suitable viral vectors are described, for 
example, in U.S. Patent Nos. 6,057,155, 5,543,328 and 5,756,086. The 

20 vectors provided herein are adenoviral vectors. 

As used herein, "adenovirus vector" and "adenoviral vector" are used 
interchangeably and are well understood in the art to mean a polynucleotide 
containing all or a portion of an adenovirus genome. An adenoviral vector, refers 
to nucleic encoding a complete genome or a modified genome or one that can be 

25 used to introduce heterologous nucleic acid when transferred into a cell, 

particularly when packaged as a particle. An adenoviral vector can be in any of 
several forms, including, but not limited to, naked DNA, DNA encapsulated in an 
adenovirus capsid, DNA packaged in another viral or viral-like form (such as 
herpes simplex, and AAV), DNA encapsulated in liposomes, DNA complexed 

30 with polylysine, complexed with synthetic polycationic molecules, conjugated 
with transferrin, complexed with compounds such as PEG to immunologically 
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"mask" the molecule and/or increase half-life, or conjugated to a non-viral 
protein. 

As used herein, oncolytic adenoviruses refer to adenoviruses that 
replicate selectively in tumor cells 
5 As used herein, a variety of vectors with different requirements and 

purposes are described. For example, one vector is used to deliver particular 
nucleic acid molecules into a packaging cell line for stable integration into a 
chromosome. These types of vectors also are referred to as complementing 
plasmids. A further type of vector carries or delivers nucleic acid molecules in or 

10 into a cell line (e.g., a packaging cell line) for the purpose of propagating viral 

vectors; hence, these vectors also can be referred to herein as delivery plasmids. 
A third "type" of vector is the vector that is in the form of a virus particle 
encapsulating a viral nucleic acid and that is comprised of the capsid modified as 
provided herein. Such vectors also can contain heterologous nucleic acid 

15 molecules encoding particular polypeptides, such as therapeutic polypeptides or 
regulatory proteins or regulatory sequences to specific cells or cell types in a 
subject in need of treatment. 

As used herein, the term "motif" is used to refer to any set of amino 
acids forming part of a primary sequence of a protein, either contiguous or 

20 capable of being aligned to certain positions that are invariant or conserved, that 
is associated with a particular function. The motif can occur, not only by virture 
of the primary sequence, but also as a consequence of three-dimensional folding. 
For example, the motif GXGXXG is associated with nucleotide-binding sites. In 
this fiber is a trimer, hence the trimeric structure can contribute fornation of a 

25 motif. Alternatively, a motif can be considered as a domain of a protein, where 
domain is a region of a protein molecule delimited on the basis of function 
without knowledge of and relation to the molecular substructure, as, e.g., the 
part of a protein molecule that binds to a receptor. As shown herein, the motif 
KKTK constitutes a consensus sequence for fiber shaft interaction with HSP. 

30 As used herein, the term "bind" or "binding" is used to refer to the 

binding between a ligand and its receptor, such as the binding of an Ad5 shaft 
motif with HSP (Heparin Sulfate Proteoglycans), with a K d in the range of 10-2 
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to 10-15 mole/I, generally, 10" 6 to 10" 15 , 10" 7 to 10" 15 and typically 10" 8 to 10" 15 
(and/or a K a of 10 5 -10 12 , 10M0 12 , 10 8 -10 12 l/mole). 

As used herein, specific binding or selective binding means that a the 
binding of a particular ligand and one receptor interaction (k a or K eq ) is at least 2- 
5 fold, generally, 5, 10, 50, 100 or more-fold, greater than for another receptor. 
A statement that a particular viral vector is targeted to a cell or tissue means 
that its affinity for such cell or tissue in a host or in vitro is at least about 2-fold, 
generally, 5, 10, 50, 100 or more-fold, greater than for other cells and tissues in 
the host or under the in vitro conditions. 

10 As used herein, the term "ablate" or "ablated" is used to refer to an 

adenovirus, adenoviral vector or adenoviral particle, in which the ability to bind 
to a particular cellular receptor is reduced or eliminated, generally substantially 
eliminated (i.e., reduced more than 10-fold, 100-fold or more) when compared to 
a coresponding wild-type adenovirus. An ablated adenovirus, adenoviral vector 

15 or adenoviral particle also is said to be detargeted, i.e., the modified adenovirus, 
adenoviral vector or adenoviral particle does not possess the native tropism of 
the wild-type adenovirus. The reduction or elimination of the ability of the 
mutated adenovirus fiber protein and/or mutated adenovirus penton protein to 
bind a cellular receptor as compared to the corresponding wild-type fiber protein 

20 and/or wild-type penton protein can be measured or assessed by comparing the 
transduction efficiency (gene transfer and expression of a marker gene) of an 
adenovirus particle containing the mutated fiber protein and/or mutated penton 
protein compared to an adenovirus particle containing the wild-type fiber protein 
and/or wild-type penton protein for cells having the cellular receptor. 

25 As used herein, tropism with reference to an adenovirus refers to the 

selective infectivity or binding that is conferred on the particle by a capsid 
protein, such as the fiber protein and/or penton. 

As used herein, "penton" or "penton complex" is used herein to designate 
a complex of penton base and fiber. The term "penton" can also be used to 

30 indicate penton base, as well as penton complex. The meaning of the term 
"penton" alone should be clear from the context within which it is used. 
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As used herein, the term "substantially eliminated" refers to a 
transduction efficiency less than about 1 1 % of the efficiency of the wild-type 
fiber containing virus on HeLa cells. The transduction efficiency on Hela cells 
can be measured (see, e.g., Example 1 of U.S. Patent Application Serial No. 
5 09/870,203 filed on 30 May 2001, and published as U.S. Published application 
No. 20020137213, and of International Patent Application No. PCT/EP01 /06286 
filed 1 June 2001). Briefly, HeLa cells are infected with the adenoviral vectors 
containing mutated fiber proteins to evaluate the effects of fiber amino acid 
mutations on CAR interaction and subsequent gene expression. Monolayers of 

10 HeLa cells in 12 well dishes are infected with, for example, 1000 particles per 
cell for 2 hours at 37° C. in a total volume of, for example, 0.35 ml of the 
DMEM containing 2% FBS. The infection medium is then aspirated from the 
monolayers and I ml of complete DMEM containing 10% FBS was added per 
well. The cells are incubated for an sufficient time, generally about 24 hours, to 

15 allow for /?-galactosidase expression, which is measured by a chemiluminescence 
reporter assay and by histochemical staining with a chromogenic substrate. The 
relative levels of y?-galactosidase activity are determined using as suitable 
system, such as the Galacto-Light chemiluminescence reporter assay system 
(Tropix, Bedford, Mass.) Cell monolayers are washed with PBS and processed 

20 according to the manufacturer's protocol. The cell homogenate is transferred to 
a microfuge tube and centrifuged to remove cellular debris. Total protein 
concentration is determined, such as by using the bicinchoninic acid(BCA) 
protein assay (Pierce, Inc., Rockford, III.) with bovine serum albumin as the 
assay standard. An aliquot of each sample is then incubated with the Tropix 

25 £-galactosidase substrate for 45 minutes in a 96 well plate. A luminometer is 
used determine the relative light units (RLU) emitted per sample and then 
normalized for the amount of total protein in each sample (RLU/ug total protein). 
For the histochemical staining procedure, the cell monolayers are fixed with 
0.5% glutaraldehyde in PBS, and then were incubated with a mixture of 1 mg of 

30 5-bromo-4-chloro-3-indolyl-)&-D-galactoside (X-gal) per ml, 5 mM potassium 

ferrocyanide, 5 mM potassium ferricyanide and 2 mM MgCI 2 in 0.5 ml of PBS. 
The monolayers are washed with PBS and the blue cells are visualized by light 
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microscopy, such as with a Zeiss ID03 microscope. Generally, the efficiency is 
less than about 9%, and typically is less than about 8%. 

As used herein, the phrase "reduce" or "reduction" refers to a change in 
the efficiency of transduction by the adenovirus containing the mutated fiber as 
5 compared to the adenovirus containing the wild-type fiber to a level of about 
75% or less of the wild-type on HeLa cells. Generally, the change in efficiency 
is to a level of about 65% or less than wild-type. Typically it is about 55% or 
less. This system is able to rapidly analyze modified fiber proteins and/or 
modified penton proteins for desired tropism in the context of the viral particle. , 

10 As used herein, the term "mutate" or "mutation" or similar terms refers to 

the deletion, insertion or change of at least one amino acid in the part of the 
fiber shaft region interacting with HSP. The amino acid can be changed by 
substitution or by modification in a way that derivatizes the amino acid. Thus, 
for example a BBXB motif or BBBXXB motif, where B is a basic amino acid, in an 

15 adenovirus is mutated to ablate the viral interaction with HSP. 

As used herein, the term "polynucleotide" means a nucleic acid molecule, 
such as DNA or RNA, that encodes a polynucleotide. The molecule can include 
regulatory sequences, and is generally DNA. Such polynucleotides are prepared 
or obtained by techniques known by those skilled in the art in combination with 

20 the teachings contained therein. 

As used herein, adenoviral genome is intended to include any adenoviral 
vector or any nucleic acid sequence comprising a modified fiber protein. All 
adenovirus serotypes are contemplated for use in the vectors and methods 
herein. 

25 As used herein, the term "viral vector" is used according to its art- 

recognized meaning. It refers to a nucleic acid vector construct that includes at 
least one element of viral origin and can be packaged into a viral vector particle. 
The viral vector particles can be used, for example, for transferring DNA into 
cells either in vitro or in vivo. 

30 As used herein, a packaging cell line is a cell line that is able to package 

adenoviral genomes or modified genomes to produce viral particles. It can 
provide a missing gene product or its equivalent. Thus, packaging cells can 
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provide complementing functions for the genes deleted in an adenoviral genome 
(e.g., the nucleic acids encoding modified fiber proteins) and are able to package 
the adenoviral genomes into the adenovirus particle. The production of such 
particles require that the genome be replicated and that those proteins necessary 
5 for assembling an infectious virus are produced. The particles also can require 
certain proteins necessary for the maturation of the viral particle. Such proteins 
can be provided by the vector or by the packaging cell. 

As used herein, detargeted adenoviral particles have ablated (reduced or 
eliminated) interaction with receptors with which native particles. Fully 

10 detargeted particles have two or more specificities altered. It is understood that 
in vivo no particles are fully ablated such that they do not interact with any cells. 
Degareted and fully degarated have reduced, typically substantial! reduced, or 
eliminated interaction with native receptors. For purposes herein, detargeted 
particles have reduced (2-fold, 5-fold, 10-fold, 100-fold or more) binding or 

15 virtually no binding to HSP receptors; fully degareted vectors include further 

capsid modifications to eliminate interactions with other receptors, such as CAR 
and integrins or other receptors. The particles still bind to cells, but the types of 
cells and interactions are reduced. 

As used herein, pseudotyping describes the production of adenoviral 

20 vectors having modified capsid protein or capsid proteins from a different 

serotype than the serotype of the vector itself. One example, is the production 
of an adenovirus 5 vector particle containing an Ad37 or Ad35 fiber protein. 
This can be accomplished by producing the adenoviral vector in packaging cell 
lines expressing different fiber proteins. As provided herein, detargeting of an 

25 adenovirus 5 particle or other serotype group C adenovirus or other adenovirus 
that binds to HSP to reduce or eliminate binding to HSPs can be effected by 
replacing all or a portion that includes the shaft or at least the HSP consensus 
binding sequence of the Ad5 fiber with an adenovirus fiber or portion thereof 
that does not bind to HSP. Adenoviruses having fiber shafts that do not interact 

30 with HSP include (a) adenoviruses of subgroup B, e.g., Ad3, Ad35, Ad7, Ad11, 
Ad16, Ad21, Ad34 which do not have interaction with HSP, (b) adenoviruses of 
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subgroup F, e.g., Ad40 and Ad41, specifically the short fiber, and (c) 
adenoviruses of subgroup D, e.g., Ad46. 

As used herein, receptor refers to a biologically active molecule that 
specifically or selectively binds to (or with) other molecules. The term "receptor 
5 protein" can be used to more specifically indicate the proteinaceous nature of a 
specific receptor. 

As used herein, the term "cyclic RGD" (or cRGD) refers to any amino acid 
that binds to a v integrins on the surface of cells and contains the sequence RGD 
(Arg-Gly-Asp). 

10 As used herein, the term "heterologous polynucleotide" means a 

polynucleotide derived from a biological source other than an adenovirus or from 
an adenovirus of a different strain or can be a polynucleotide that is in a different 
locus from wild-type virus. The heterologous polynucleotide can encode a 
polypeptide, such as a toxin or a therapeutic protein. The heterologous 

15 polynucleotide can contain regulatory regions, such as a promoter regions, such 
as a promoter active in specific cells or tissue, for example, tumor tissue as 
found in oncolytic adenoviruses. Alternatively, the heterologous polynucleotide 
can encode a polypeptide and further contain a promoter region operably linked 
to the coding region. 

20 As used herein, reference to an amino acid in an adenovirus protein or to 

a nucleotide in an adenovirus genome is with reference to Ad5, unless specified 
otherwise. Corresponding amino acids and nucleotides in other adenovirus 
strains and modified strains and in vectors can be identified by those of skill in 
the art. Thus recitation of a mutation is intended to encompass all adenovirus 

25 strains that process a corresponding locus. 

As used herein, the KO mutations refer to mutations in fiber that knock 
out binding to CAR. For example, a KOI mutation refers to a mutation in the 
Ad5 fiber and corresponding mutations in other fiber proteins. In Ad5, this 
mutation results in a substitution of fiber amino acids 408 and 409, changing 

30 them from serine and proline to glutamic acid and alanine, respectively. As used 
herein, a K012 mutation refers to a mutation in the Ad5 fiber and corresponding 
mutations in other fiber proteins. In Ad5, this mutation is a four amino acid 
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substitution as follows: R512S, A515G, E516G, and K517G. Other KO 
mutations can be identified empirically or are known to those of skill in the art. 

As used herein, PD mutations refer to mutations in the penton gene that 
ablate binding by the encoded to a v integrin by replacing the RGD tripeptide. 
5 The PD1 mutation exemplified herein results in a substitution of amino acids 337 
through 344 of the Ad5 penton protein, HAIRGDTF (SEQ ID No. 9), with amino 
acids SRGYPYDVPDYAGTS (SEQ ID No. 10), thereby replacing the RGD 
tripeptide. 

As used herein, treatment means any manner in which the symptoms of a 

10 condition, disorder or disease are ameliorated or otherwise beneficially altered. 

As used herein, a therapeutically effective product is a product that is 
encoded by heterologous DNA that, upon introduction of the DNA into a host, a 
product is expressed that effectively ameliorates or eliminates the symptoms, 
manifestations of an inherited or acquired disease or that cures said disease. 

15 As used herein, a subject is an animal, such as a mammal, typically a 

human, including patients. 

As used herein, genetic therapy involves the transfer of heterologous 
DNA to the certain cells, target cells, of a mammal, particularly a human, with a 
disorder or conditions for which such therapy is sought. The DNA is introduced 

20 into the selected target cells in a manner such that the heterologous DNA is 
expressed and a therapeutic product encoded thereby is produced. 
Alternatively, the heterologous DNA may in some manner mediate expression of 
DNA that encodes the therapeutic product, it may encode a product, such as a 
peptide or RNA that in some manner mediates, directly or indirectly, expression 

25 of a therapeutic product. Genetic therapy may also be used to deliver nucleic 
acid encoding a gene product to replace a defective gene or supplement a gene 
product produced by the mammal or the cell in which it is introduced. The 
introduced nucleic acid may encode a therapeutic compound, such as a growth 
factor inhibitor thereof, or a tumor necrosis factor or inhibitor thereof, such as a 

30 receptor therefor, that is not normally produced in the mammalian host or that is 
not produced in therapeutically effective amounts or at a therapeutically useful 
time. The heterologous DNA encoding the therapeutic product may be modified 
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) 

prior to introduction into the cells of the afflicted host in order to enhance or 
otherwise alter the product or expression thereof. 

As used herein, a therapeutic nuucleic acid is a nucleic acid that endes a 
therapeutic product. The product can be nucleic acid, such as a regulatory 
5 sequence or gene, or can encode a protein that has a therapeutic activity or 
effect. For example, therapeutic nucleic acid can be a ribozyme, antisense, 
double-stranded RNA, a nucleic acid encoding a protein and others. 

As used herein, "homologous" means about greater than 25% nucleic 
acid sequence identity, such as 25% 40%, 60%, 70%, 80%, 90% or 95%. If 

10 necessary the percentage homology will be specified. The terms "homology" 

and "identity" are often used interchangeably. In general, sequences are aligned 
so that the highest order match is obtained (see, e.g.: Computational Molecular 
Biology, Lesk, A.M., ed., Oxford University Press, New York, 1988; 
Biocomputing: Informatics and Genome Projects, Smith, D.W., ed., Academic 

15 Press, New York, 1993; Computer Analysis of Sequence Data, Parti, Griffin, 
A.M., and Griffin, H.G., eds., Humana Press, New Jersey, 1994; Sequence 
Analysis in Molecular Biology, von Heinje, G., Academic Press, 1987; and 
Sequence Analysis Primer, Gribskov, M. and Devereux, J., eds., M Stockton 
Press, New York, 1991; Carillo et aL (1988) S/AM J Applied Math 45:1073). 

20 By sequence identity, the number of conserved amino acids are determined by 
standard alignment algorithms programs, and are used with default gap penalties 
established by each supplier. Substantially homologous nucleic acid molecules 
would hybridize typically at moderate stringency or at high stringency all along 
the length of the nucleic acid or along at least about 70%, 80% or 90% of the 

25 full-length nucleic acid molecule of interest. Also contemplated are nucleic acid 
molecules that contain degenerate codons in place of codons in the hybridizing 
nucleic acid molecule. 

Whether any two nucleic acid molecules have nucleotide sequences that 
are at least, for example, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% 

30 "identical" can be determined using known computer algorithms such as the 
"FAST A" program, using for example, the default parameters as in Pearson et 
aL (1988) Proc. Natl. Acad. Sci. USA 55:2444 (other programs include the GCG 
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program package (Devereux, J., et al.. Nucleic Acids Research 72(f):387 
(1984)), BLASTP, BLASTN, FASTA (Atschul, S.F., et al., J Molec Biol 2/5:403 
(1990); Guide to Huge Computers, Martin J. Bishop, ed., Academic Press, San 
Diego, 1994, and Carillo et al. (1988) SI AM J Applied Math 45:1073). For 
5 example, the BLAST function of the National Center for Biotechnology 

Information database can be used to determine identity. Other commercially or 
publicly available programs include, DNAStar "MegAlign" program (Madison, Wl) 
and the University of Wisconsin Genetics Computer Group (UWG) "Gap" 
program (Madison Wl)). Percent homology or identity of proteins and/or nucleic 

10 acid molecules can be determined, for example, by comparing sequence 

information using a GAP computer program {e.g., Needleman et al. (1970) J. 
Mol. Biol. 48:443, as revised by Smith and Waterman ((1981) Adv. Appl. Math. 
2:482). Briefly, the GAP program defines similarity as the number of aligned 
symbols (i.e., nucleotides or amino acids) which are similar, divided by the total 

15 number of symbols in the shorter of the two sequences. Default parameters for 
the GAP program can include: (1) a unary comparison matrix (containing a value 
of 1 for identities and 0 for non-identities) and the weighted comparison matrix 
of Gribskov et al. (1986) Nucl. Acids Res. 14:6745, as described by Schwartz 
and Dayhoff, eds., ATLAS OF PROTEIN SEQUENCE AND STRUCTURE, National 

20 Biomedical Research Foundation, pp. 353-358 (1979); (2) a penalty of 3.0 for 

each gap and an additional 0.10 penalty for each symbol in each gap; and (3) no 
penalty for end gaps. Therefore, as used herein, the term "identity" represents a 
comparison between a test and a reference polypeptide or polynucleotide. 

As used herein, the term "at least 90% identical to" refers to percent 

25 identities from 90 to 99.99 relative to the reference polypeptides. Identity at a 
level of 90% or more is indicative of the fact that, assuming for exemplification 
purposes a test and reference polynucleotide length of 100 amino acids are 
compared, no more than 10% (i.e., 10 out of 100) of amino acids in the test 
polypeptide differs from that of the reference polypeptides. Similar comparisons 

30 can be made between a test and reference polynucleotides. Such differences 
can be represented as point mutations randomly distributed over the entire 
length of an amino acid sequence or they can be clustered in one or more 
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locations of varying length up to the maximum allowable, e.g. 10/100 amino 
acid difference (approximately 90% identity). Differences are defined as nucleic 
acid or amino acid substitutions, or deletions. At the level of homologies or 
identities above about 85-90%, the result should be independent of the program 
5 and gap parameters set; such high levels of identity can be assessed readily, 
often without relying on software. 

As used herein: stringency of hybridization in determining percentage 
mismatch is as follows: 

1) high stringency: 0.1 x SSPE, 0.1% SDS, 65 °C 

10 2) medium stringency: 0.2 x SSPE, 0.1 % SDS, 50°C 

3) low stringency: 1 .0 x SSPE, 0.1 % SDS, 50°C 
Those of skill in this art know that the washing step selects for stable 
hybrids and also know the ingredients of SSPE (see, e.g., Sambrook, E.F. 
Fritsch, T. Maniatis, in: Molecular Cloning, A Laboratory Manual, Cold Spring 

15 Harbor Laboratory Press (1989), vol. 3, p. B.13, see, also, numerous catalogs 
that describe commonly used laboratory solutions). SSPE is pH 7.4 phosphate- 
buffered 0.18 M NaCI. Further, those of skill in the art recognize that the 
stability of hybrids is determined by T m , which is a function of the sodium ion 
concentration and temperature (T m = 81 .5° C-1 6.6(log 10 [Na + ]) + 0.41(%G + Q- 

20 600/D), so that the only parameters in the wash conditions critical to hybrid 
stability are sodium ion concentration in the SSPE (or SSC) and temperature. 

It is understood that equivalent stringencies can be achieved using 
alternative buffers, salts and temperatures. By way of example and not 
limitation, procedures using conditions of low stringency are as follows (see also 

25 Shilo and Weinberg, Proc. Natl. Acad. ScL USA 7S:6789-6792 (1981)): Filters 
containing DNA are pretreated for 6 hours at 40° C in a solution containing 35% 
formamide, 5X SSC, 50 mM Tris-HCI (pH 7.5), 5 mM EDTA, 0.1 % PVP, 0.1 % 
Ficoll, 1% BSA, and 500 /yg/ml denatured salmon sperm DNA (10X SSC is 1.5 
M sodium chloride, and 0.15 M sodium citrate, adjusted to a pH of 7). 

30 Hybridizations are carried out in the same solution with the following 

modifications: 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 100 /yg/ml salmon sperm 
DNA, 10% (wt/vol) dextran sulfate, and 5-20 X 10 6 cpm 32 P-labeled probe is 
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used. Filters are incubated in hybridization mixture for 18-20 hours at 40°C, 
and then washed for 1 .5 hours at 55°C in a solution containing 2X SSC, 25 mM 
Tris-HCI (pH 7.4), 5 mM EDTA, and 0.1% SDS. The wash solution is replaced 
with fresh solution and incubated an additional 1.5 hours at 60°C. Filters are 
5 blotted dry and exposed for autoradiography. If necessary, filters are washed for 
a third time at 65-68°C and reexposed to film. Other conditions of low 
stringency which can be used are well known in the art (e.g., as employed for 
cross-species hybridizations). 

By way of example and not way of limitation, procedures using 

10 conditions of moderate stringency include, for example, but are not limited to, 
procedures using such conditions of moderate stringency are as follows: Filters 
containing DNA are pretreated for 6 hours at 55 °C in a solution containing 6X 
SSC, 5X Denhart's solution, 0.5% SDS and 100/yg/ml denatured salmon sperm 
DNA. Hybridizations are carried out in the same solution and 5-20 X 10 6 cpm 

15 32 P-labeled probe is used. Filters are incubated in hybridization mixture for 18-20 
hours at 55 °C, and then washed twice for 30 minutes at 60°C in a solution 
containing 1X SSC and 0.1 % SDS. Filters are blotted dry and exposed for 
autoradiography. Other conditions of moderate stringency which can be used 
are well-known in the art. Washing of filters is done at 37 °C for 1 hour in a 

20 solution containing 2X SSC, 0.1 % SDS. 

By way of example and not way of limitation, procedures using conditions 
of high stringency are as follows: Prehybridization of filters containing DNA is 
carried out for 8 hours to overnight at 65 °C in buffer composed of 6X SSC, 
50 mM Tris-HCI (pH 7.5), 1 mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.02% BSA, 

25 and 500 jt/g/ml denatured salmon sperm DNA. Filters are hybridized for 48 hours 
at 65 °C in prehybridization mixture containing 100 ^/g/ml denatured salmon 
sperm DNA and 5-20 X 10 6 cpm of 32 P-labeled probe. Washing of filters is done 
at 37°C for 1 hour in a solution containing 2X SSC, 0.01 % PVP, 0.01 % Ficoll, 
and 0.01 % BSA. This is followed by a wash in 0.1 X SSC at 50°C for 45 

30 minutes before autoradiography. Other conditions of high stringency which can 
be used are well known in the art. 
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The term substantially identical or substantially homologous or similar 
varies with the context as understood by those skilled in the relevant art and 
generally means at least 60% or 70%, preferably means at least 80%, 85% or 
more preferably at least 90%, and most preferably at least 95% identity. 
5 As used herein, substantially identical to a product means sufficiently 

similar so that the property of interest is sufficiently unchanged so that the 
substantially identical product can be used in place of the product. 

As used herein, substantially pure means sufficiently homogeneous to 
appear free of readily detectable impurities as determined by standard methods 

10 of analysis, such as thin layer chromatography (TLC), gel electrophoresis and 

high performance liquid chromatography (HPLC), used by those of skill in the art 
to assess such purity, or sufficiently pure such that further purification would 
not detectably alter the physical and chemical properties, such as enzymatic and 
biological activities, of the substance. Methods for purification of the 

15 compounds to produce substantially chemically pure compounds are known to 
those of skill in the art. A substantially chemically pure compound can, 
however, be a mixture of stereoisomers or isomers. In such instances, further 
purification might increase the specific activity of the compound. 

The methods and and preparation of products provided herein, unless 

20 otherwise indicated, employ conventional techniques of chemistry, molecular 

biology, microbiology, recombinant DNA, genetics, immunology, cell biology, cell 
culture and transgenic biology, which are within the skill of the art (see, e.g., 
Maniatis eta/. (1982) Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, NY); Sambrook et al. (1989) 

25 Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, NY; Ausubel et al. (1992) Current Protocols in Molecular 
Biology, Wiley and Sons, New York; Glover (1985) DNA Cloning I and II, Oxford 
Press; Anand (1992) Techniques for the Analysis of Complex Genomes 
(Academic Press); Guthrie and Fink (1991) Guide to Yeast Genetics and 

30 Molecular Biology, Academic Press; Harlow and Lane (1988) Antibodies: A 
Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harobor, NY; 
Jakoby and Pastan, eds. (1979) Cell Culture. Methods in Enzymology 58, 
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Academic Press, Inc., Harcourt Brace Jaovanovich, NY; Nucleic Acid 
Hybridization (B. D. Hames & S. J. Higgins eds. 1984); Culture Of Animal Cells 
(R. I. Freshney, Alan R. Liss, Inc., 1987); Immobilized Cells And Enzymes (IRL 
Press, 1986); B. Perbal (1984), A Practical Guide To Molecular Cloning) Gene 
5 Transfer Vectors For Mammalian Cells (J. H. Miller and M. P. Calos eds., 1987, 
Cold Spring Harbor Laboratory); Methods In Enzymology, Vols. 154 and 155 
(Wu et al. eds.); Immunochemical Methods In Cell And Molecular Biology (Mayer 
and Walker, eds., Academic Press, London, 1987); Handbook Of Experimental 
Immunology, Volumes l-IV (D. M. Weir and C. C. Blackwell, eds., 1986); Hogan 

10 et al. (1986) Manipulating the Mouse Embryo, Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, N.Y. 
B. Capsid modifications 

Provided herein are modifications of the viral capsid that ablate the 
interaction of an adenovirus with its natural receptors. In particular, fiber 

15 modifications that result in ablation of the interaction of an adenvirus with HSP 
are provided. These fiber modifications can be combined with other capsid 
protein modifications, such as other fiber modifications and/or penton and/or 
hexon modifications, to fully ablate viral interactions with natural receptors, 
when expressed on a viral particle. The modification should not disrupt trimer 

20 formation or transport of fiber into the nucleus. 
1 . Fiber genes and proteins 

The fiber protein extends from the capsid and mediates viral binding to 

the cell surface by binding to specific cell receptors (Philipson et al. (1968) J. 

Virol. 2:1064-1075). The fiber is a trimeric protein that includes an N-terminal 
25 tail domain that interacts with the adenovirus penton base, a central shaft 

domain of varying length, and a C-terminal knob domain that contains the cell 

receptor binding site (Chroboczek et al. (1995) Curr. Top. Microbiol. Immunol. 

735:163-200; Riurok et al. (1990) J.Mol.Biol. 2/5:589-596; Stevenson et al. 

(1995) J. Virol. 63:2850-2857). The sequences of the fiber gene from a variety 
30 of serotypes including adenovirus serotypes 2 (Ad2), Ad5, Ad3, Ad35, Ad12, 

Ad40, and Ad41 are known. There are at least 21 different fiber genes in 

Genbank. 



WO 2003/062400 



PCT/US2003/002295 



-28- 

As noted, the fiber protein can be divided into three domains (see, e.g., 
Green et al. (1983) EMBO J. 2:1357-1365). The conserved N-terminus contains 
the sequences responsible for association with the penton base as well as a 
nuclear localization signal. A rod-like shaft of variable length contains repeats of 
5 a 1 5 amino acid beta structure, with the number of repeats ranging from 6 in 
Ad3 to 22 in Ad5. A conserved stretch of amino acids which includes the 
sequence TLWT (SEQ ID No. 36) marks the boundary between the repeating 
units of beta structure in the shaft and the globular head domain. The C-terminal 
head domain ranges in size from 157 amino acid residues for the short fiber of 

10 Ad41 to 193 residues for Ad1 1 and Ad34. The fiber spike is a homotrimer and 
it is thought that the C-terminus is responsible for trimerization of the fiber 
homotrimer and there are 12 spikes per virion which are attached via association 
with the penton base complex. 

2. Modification of HSP interaction 

15 The adenovirus fiber protein is a major determinant of adenovirus tropism 

(Gall et al. (1 996) J. Virol. 70:2116-2123; Stevenson et al. (1 995) J. Virol. 
63:2850-2857). Dogma in the field has been that adenoviral entry occurs via 
binding to CAR and integrins. This is underscored by published data (Einfeld et 
al. (2001 ) J. Virology 75'A 1 284-1 1 291 ). It is shown herein, however, these 

20 published entry pathways are not the predominant ones that act in vivo. 

Moreover, as shown herein, the dominant entry pathway for hepatocytes in vivo 
involves a mechanism mediated by the fiber shaft, such as Ad5 shaft, through 
heparin sulfate proteoglycans binding. 

It is shown herein that elimination of this binding eliminates entry vis HSP 

25 binding, such as in hepatocytes. Adenoviral fiber shaft modifications that ablate 
viral interaction with HSP are provided. Thus, as provided herein, efficient 
detargeting of adenovirus in vivo can be achieved with appropriately designed 
fiber proteins. Suitable modifications, such as described herein, can be made 
with respect to any adenovirus in which the wild-type interacts with HSP. 

30 As provided herein, the ability of an adenoviral vector to interact with 

HSP is modified. In particular, the ability to interact is reduced or eliminated. 
Modifications include insertions, deletions, individual amino acid mutations and 
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other mutations that alter the structure of the fiber shaft such that the HSP 
binding of the modified fiber protein is ablated when compared to the HSP 
binding of the wild-type fiber protein. 

In a first aspect of this embodiment, an adenoviral fiber protein is 
5 modified by mutating one or more of the amino acids that interact with HSP. 

For example, the HSP binding motif of the modified fiber protein is no longer able 
to interact with HSP on the cell surface, thus ablating the viral interaction with 
HSP. For example, the adenoviral fiber is from a subgroup C adenovirus. 
Binding to HSP can be eliminated or reduced by mutating the fiber shaft in order 

10 to modify the ability of the HSP binding motif, which is, for example, KKTK 

sequence (SEQ ID No. 1) located between amino acid residues 91 to 94 in the 
Ad 5 fiber, to interact with HSP. The fiber proteins are modified by chemical 
and biological techniques known to those skilled in the art, such as site directed 
mutagenesis of nucleic aicd encoding the fiber or other techniques as illustrated 

15 herein. 

In another aspect of this embodiment, the ability of a fiber to interact 
with HSP is modified by replacing the wild-type fiber shaft with a fiber shaft, or 
portion thereof, of an adenovirus that does not interact with HSP to produce 
chimeric fiber proteins. The portion is sufficient to reduce or eliminate 

20 interaction with HSP. Examples of adenoviruses having fiber shafts that do not 
interact with HSP include (a) adenoviruses of subgroup B, such as, but are not 
limited to, Ad3, Ad35, Ad7, Ad11, Ad16, Ad21, Ad34, which do not have 
interaction with HSP, (b) adenoviruses of subgroup F, such as, but are not 
limited to, Ad40 and Ad41, specifically the short fiber, and (c) adenoviruses of 

25 subgroup D, such as but are not limited to, Ad46. In another embodiment, 
adenoviral fiber shaft modifications that ablate viral interaction with HSP in 
combination with adenoviral fiber knob modifications that ablate viral interactions 
with CAR are provided. Suitable adenoviral fiber modifications include the 
fiber knob modifications are known to those of skill in the art and are exemplified 

30 herein (see, also, US. Patent Application Serial No. 09/870,203, filed on 30 May 
2001, and published as U.S. Published application No. 20020137213, in 
International Patent Application No. PCT/EP01 /06286 filed on 1 June 2001). 



WO 2003/062400 



PCT/US2003/002295 



-30- 

Modifications of the fiber include mutations of at least one amino acid in the CD 
loop of a wild-type fiber protein of an adenovirus from subgroup C, D, or E, or 
the long wild-type fiber of an adenovirus from subgroup F, whereby the ability of 
a fiber protein to bind to CAR is reduced or substantially eliminated. The fiber 
5 proteins with ablated CAR interaction are modified by chemical and biological 
techniques known to those skilled in the art, as illustrated herein and as 
described in the above patent application. 

Alternatively, adenoviral fiber modifications are made by replacing the 
wild-type fiber knob with a fiber knob of an adenovirus that does not interact 

10 with CAR. The fiber protein also will be selected so that it does not interact 
with HSP. Examples of adenoviruses having fiber knobs that do not interact 
with CAR include (a) adenoviruses of subgroup B, e.g., Ad3, Ad35, Ad7, Ad1 1, 
Ad16, Ad21, Ad34, (b) adenoviruses of subgroup F, e.g., Ad40 and Ad41, 
specifically the short fiber. 

15 In another embodiment, adenoviral fiber shaft modifications that ablate 

viral interaction with HSP in combination with penton modifications that ablate 
viral interactions with o v integrins are provided. Suitable adenoviral penton 
modifications include the penton modifications, which are well known to those^ 
of skill in the art (see, e.g., U.S. Patent No. 5,731,190; see, also Einfeld et al. 

20 (2001 ) J. Virology 75'A 1 284-1 1 291 ; and Bai et al. (1 993) J. Virology 
57:5198-5205). 

For example, penton interaction with a v integrins can be ablated (reduced 
or eliminated) by substitution of the RGD tripeptide motif, required for a v 
interaction, in penton with a different tripeptide that does not interact with an a v 

25 integrin. The penton proteins with ablated a v integrin interactions are modified 
by chemical and biological techniques known to those skilled in the art (see, 
e.g., described U.S. Patent No. 6,731,190 and as illustrated herein). Generally, 
the adenovirus is a subgroup B or C adenovirus. 

Also provided are adenoviral fiber shaft modifications that ablate viral 

30 interaction with HSP in combination with adenoviral fiber knob modifications that 
ablate viral interactions with CAR and with penton modifications that ablate viral 
interactions with a v integrins. These modifications are described above and 
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prepared using chemical and biological techniques known to those skilled in the 
art and as illustrated herein. Generally the adenovirus is a subgroup B or 
subgroup C adenovirus. 

Preparation of fibers modified to eliminate or reduce HSP interactions and 
5 fibers modified to alter interactions with other receptors and cell surface 
proteins, such as CAR and/or a v integrin, is also described in the Examples 
below. The nucleic acid and/or amino acid sequences of exemplary modified 
fibers, whose construction are described below) are set forth as SEQ ID Nos. 45- 
72 as follows: 

10 SEQ ID Nos. 45 and 46 set forth the encoding nucleotide sequence and 

amino acid sequence of the modified fiber designated 5FK01, where 5F refers to 
Adenovirus 5 fiber, KOI is an exemplary mutation of the CAR interaction site 
described herein; 

SEQ ID Nos. 47 and 48 set forth the encoding nucleotide sequence and 
15 amino acid sequence of the modified ber designated 5FK01RGD, which further 
includes an RGD ligand to demonstrate retargeting; 

SEQ ID Nos. 49 and 50 set forth the encoding nucleotide sequence and 
amino acid sequence of the modified fiber designated 5FKQ12, where 5F refers 
to Adenovirus 5 fiber, K012 is another exemplary mutation of the CAR 
20 interaction site described herein; 

SEQ ID Nos. 51 and 52 set forth the encoding nucleotide sequence and 
amino acid sequence of the modified fiber designated 5F S* nuc, where 5F 
refers to Adenovirus 5 fiber, S* is an exemplary mutation of the shaft that alters 
binding to HSP; 

25 SEQ ID Nos. 53 and 54 set forth the encoding nucleotide sequence and 

amino acid sequence of the modified fiber designated 5F S*RGD nuc, which 
further includes an RGD ligand; 

SEQ ID Nos. 55 and 56 set forth the encoding nucleotide sequence and 
amino acid sequence of the modified ber designated 5FK01S*, which contain 

30 the KQ1 and S* mutations; 
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SEQ ID Nos. 57 and 58 set forth the encoding nucleotide sequence and 
amino acid sequence of the modified fiber designated 5FK01S*RGD, which 
further includes an RGD ligand; 

SEQ ID Nos. 59 and 60 set forth the encoding nucleotide sequence and 
5 amino acid sequence of a Ad35 fiber; 

SEQ ID Nos. 61 and 62 set forth the encoding nucleotide sequence and 
amino acid sequence of the modified fiber designated 35FRGD, which is 35F 
fiber with an RGD ligand; 

SEQ ID Nos. 63 and 64 set forth the encoding nucleotide sequence and 
10 amino acid sequence of a Ad41 short fiber; 

SEQ ID Nos. 65 and 66 set forth the encoding nucleotide sequence and 
amino acid sequence of the modified fiber designated 41 sFRGD, which is 41 F 
short fiber with an RGD ligand; 

SEQ ID Nos. 67 and 68 set forth the encoding nucleotide sequence and 
15 amino acid sequence of Ad5 penton; 

SEQ ID Nos. 69 and 70 set forth the encoding nucleotide sequence and 
amino acid sequence of the modified fiber designated 5TS35H, which is a 
chimeric fiber in which an Ad5 fiber tail and shaft regions (5TS; amino acids 1 to 
403) are connected to an Ad35 fiber head region (35H; amino acids 137 to 323) 
20 to form the 5TS35H chimera; and 

SEQ ID Nos. 71 and 72 set forth the encoding nucleotide sequence and 
amino acid sequence of the modified fiber designated 35TS5H, which is a 
chimeric fiber in which an Ad35 fiber tail and shaft regions (35TS; amino acids 1 
to 136) are connected to an Ad5 fiber head region (5H; amino acids 404 to 581) 
25 to form the 35TS5H chimera. 

SEQ ID No. 1 sets forth the nucleotide sequence of Ad fiber; SEQ ID Nos. 
2 and 3 also set forth the coding nucleic acid sequencs for fibers with modified 
fiber knobs for ablated CAR interaction (see, SEQ ID No. 2 for K01 and SEQ ID 
No. 3 for K012); SEQ ID No. 4 also sets for the encoding nucleic acid sequence 
30 of a modified penton for ablated a v integrins (SEQ ID No. 4). 

The modified fibers are displayed on virus particles by modifying the fiber 
protein and optionally additional proteins. This can be achieved by preparing 



WO 2003/062400 



PCT/US2003/002295 



-33- 

adenoviral vectors that express the modified capsid proteins and produce 

particles with modified fibers, or by packaging adenoviral vectors, particularly 

those that do not encode one or more capsid proteins in appropriate packaging 

lines. Hence, as discussed in detail below, adenoviral vectors and viral particles 

5 with modified fibers that do no bind to HSP are provided. 

C. Nucleic acids. Adenoviral vectors and cells containing the nucleic acids 
and cells containing the vectors 

Also provided are polynucleotides that encode modified capsid proteins 
and that encode vectors for preparation of adenovirus that express modified 

10 capsid proteins provided herein. The sequences of the wild-type adenovirus 

proteins are well known in the art and are modified as described herein. Nucleic 
acid molecules, such as cDNA that encode an exemplary modified fiber knob for 
ablated CAR interaction (see, SEQ ID No. 2 for KOI and SEQ ID No. 3 for K012) 
and for a modified penton for ablated a v integrins (SEQ ID No. 4) are provided. 

15 As discussed above, modified capsid proteins with altered tropism for CAR and 
a v integrins are known and described in the patents, applications and literature 
cited herein and known to those of skill in the art (see, e.g., U.S. Patent No. 
5,731,190, U.S. application Serial No. 09/870,203, published as U.S. Published 
application No. 20020137213; and Bai et al. (1993) J. Virology 

20 67:5198-5208). 

Also provided are vectors including the polynucleotides provided herein. 
Such vectors include partial or complete adenoviral genomes and plasmids. Such 
vectors are constructed by techniques known to those skilled in the art and as 
illustrated herein. Also provided are adenoviral vectors modified by replacing 

25 whole fiber protein, or portions thereof, with the fiber proteins, or appropriate 
portions thereof, of an adenovirus that does not interact with HSP. 
Adenoviruses that do not interact with HSP can be identified by using the 
methods described herein which detect binding or non-binding of fiber proteins 
and adenoviruses with HSP. Among the adenoviral vectors provided herein are 

30 those of subgroup C, which include Ad2 and Ad5, in which the nucleic acid 
encoding the fiber shaft or a portion including the HSP-binding portion is 
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replaced with nucleic acid encoding fiber or an appropriate portion thereof from a 
serotype, such as Ad35. 

Adenoviral fiber modifications, thus, can be made in viral particles by 
replacing the entire fiber protein with the fiber protein of an adenovirus that does 
5 not interact with CAR and/or replacing the HSP binding portion with a portion 
that does not bind. Generally the adenovirus is a subgroup B or subgroup C 
adenovirus, and also an adenovirus of subgroup D, such as Ad46. Adenoviral 
vectors of subgroup C, such as Ad2 and Ad5, having a replaced fiber knob are 
prepared using techniques well known in the art and as illustrated herein. 

10 1 . Preparation of viral particles 

The packaging cells used to produce the viruses provided herein contain 
the nucleic acid encoding the capsid protein, including the mutated fiber protein 
provided herein. Such nucleic acid can be transfected into the cell, generally 
part of as part of plasmid, or it can be infected into the cell with a viral vector. It 

15 can be stably incorporated into the genome of the cell, thus providing for a 

stable cell line. Alternatively, nucleic acid encoding the mutated capsid protein 
can be removed from the genome, in which case a transient complementing cell 
is employed. 

The adenovirus genome to be packaged is transferred into the ( 
20 complementing cell by techniques known to those skilled in the art. These 

techniques include transfection or infection with the adenovirus. The nucleic acid 
encoding the mutated fiber protein can be in this genome instead of in the 
packaging cell. 

In certain cases, when the nucleic acid encoding the mutated fiber is in 
25 the genome to be packaged, it can be desirable for the packaging cell to also 
encode a fiber protein. Such protein can assist in the maturation and packaging 
of an infectious particle. Such protein can be a wild-type fiber protein or one 
modified such that it is unable to attach to the penton base protein and is for 
use, for example, in producer cells where the fiber is included to provide the 
30 packaging function and the vector encodes a full-length fiber. 

The packaging cells are cultured under conditions that permit the 
production of the desired viral particle. The viral particles are recovered by 
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standard techniques. An exemplary method for producing adenoviral particles 
provided herein is as follows. The nucleic acid encoding the mutated fiber 
protein is made using standard techniques in an adenoviral shuttle plasmid. This 
plasmid contains the right end of the virus, in particular from the end of the E3 
5 region through the right ITR. This plasmid is co-transfected into competent cells 
of an E. coli strain, such as the well known E. coli strain BJ5183 (see, e.g., 
Degryse (1996) Gene 1 70:45-50) along with a plasmid, which contains the 
remaining portion of the adenovirus genome, except for the E1 region and 
sometimes also the E2a region and also contains a corresponding region of 
10 homology. Homologous recombination between the two plasmids generates a 
full-length plasmid encoding the entire adenoviral vector genome. 

This full-length adenoviral vector genome plasmid is then transfected into 
a complementing cell line. The transfection can be performed in the presence of 
a reagent that directs adenoviral particle entry into producer cells. Such 
15 reagents include, but are not limited to, polycations and bifunctional reagents, 
such as those described herein. A complementing cell is, for example, is a cell 
of the PER.C6 cell line, which contains the adenoviral E1 gene (PER.C6 is 
available, for example, from Crucell, The Netherlands; deposited under ECACC 
accession no. 96022940; see, also Fallaux et al. (1998) Hum. Gene Ther. 
20 3:1909-1907; see, also, U.S. Patent No. 5,994,128) or an AE1-2a cell (see, 
Gorziglia et al. (1996) J. Virology 70:4173-4178; and and Von Seggern et al. 
(1998) J, Gen. Virol. 73:1461-1468)). 

AE1-2a cells are derivatives of the A549 lung carcinoma line (ATCC # 
CCL 185) with chromosomal insertions of the plasmids pGRE5-2.E1 (also 
25 referred to as GRE5-E1 -SV40-Hygro construct and listed in SEQ ID No. 41) and 
pMNeoE2a-3.1 (also referred to as MMTV-E2a-SV40-Neo construct and listed in 
SEQ ID No. 42), which provide complementation of the adenoviral E1 and E2a 
functions, respectively. 

The 633 cell line (see, von Seggern et al. (2000) J. Virology 
30 74:354-362), which stably expresses the adenovirus serotype 5 wild-type fiber 
protein, and was derived from the AE1-2a cell line, is another an example of 
complementing cells. When the cell line is 633 cells, the final passage of 
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adenoviral vector is performed on another complementing cell line (e.g., Per.C6), 
which does not express wild-type Ad5 fiber. 

Jhe transfected complementing cells are maintained under standard cell 
culture conditions. The adenoviral plasmids recombine to form the adenoviral 
5 genome that is packaged. The particles are infectious, but replication deficient 
because their genome is missing at least the E1 genes. When performed in the 
633 cells the particles contain wild-type and mutated fiber proteins. They are 
recovered from the crude viral lysate, amplified, and are purified by standard 
techniques. 

10 The recovered particles can be used to infect PER.C6 or AE1-2a cells. 

This permits the recovery of particles whose capsids contain only the desired 
mutated fiber. This two-step procedure provides high titer batches of the 
adenoviral particles provided herein. The adenoviral particles can be replication 
competent or replication incompetent. 

15 In one embodiment, the particles selectively replicate in certain 

predetermined target tissue but are replication incompetent in other cells and 
tissues. In a particular embodiment, the adenoviral particles replicate in 
abnormally proliferating tissue, such as solid tumors and other neoplasms. In 
replication conditional adenoviruses, a gene essential for replication is placed 

20 under control of a heterologous promoter which is cell or tissue specific. For 

example, the E1a gene is placed under control of a promoter which is active in a 
tumor cell to produce an oncolytic adenovirus or oncolytic adenoviral vector. 
Administration of oncolytic adenoviral vectors to tumor cells kills the tumor cells. 
Such replication conditional adenoviral particles and vectors can be produced by 

25 techniques known to those skilled in the art, such as those disclosed in the 

above-referenced U.S. Patent Nos. 5,998,205 and 5,801,029. These particles 
and vectors can be produced in adenoviral packaging cells as disclosed above. 
Generally packaging cells are those that have been designed to limit homologous 
recombination that could lead to wild-type adenoviral particles. Such cells are 

30 well known and include the packaging cell known as PER.C6 (see, e.£r.,U.S. 

Patent Nos. 5,994,128 and 6,033,908; deposited under ECACC accession no. 
96022940). Since oncolytic vectors are replication competent in certain cell 
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types, they can be amplified in cell lines derived from said cell type without 
provision of Ad complementary genes. 

2. Adenoviral vectors and particles 

The adenovirus as used herein for production of the adenoviral vectors 
5 and particles can be of any serotype. Adenoviral stocks that can be employed 
as a source of adenovirus or adenoviral coat protein, such as fiber and/or penton 
base, can be amplified from the adenoviral serotypes 1 through 47, which are 
currently available from the American Type Culture Collection (ATCC, Rockville, 
Md.), or from any other serotype of adenovirus available from any other source. 
0 For instance, an adenovirus can be of subgroup A (e.g., serotypes 12, 18, 31), 
subgroup B (e.g., serotypes 3, 7, 1 1 , 14, 1 6, 21 , 34, 35), subgroup C (e.g., 
serotypes 1, 2, 5, 6), subgroup D (e.g., serotypes 8, 9, 10, 13, 15, 17, 19, 20, 
22-30, 32, 33, 36-39, 42-47), subgroup E (serotype 4), subgroup F (serotype 
40, 41), or any other adenoviral serotype. 
5 In certain embodiments, the adenovirus is a subgroup B or a subgroup C 

adenovirus. Subgroup C adenoviruses which are modified in as described herein, 
include, but are not limited to, Ad2 and Ad5. For Ad5, the mutation is made in 
the KKTK sequence (SEQ ID No. 1) located between amino acid residues 91 to 
94. The fiber proteins can be modified by chemical and biological techniques 
20 known to those skilled in the art. These methods include, but are not limited to, 
site directed mutagenesis and techniques as illustrated herein. 

The adenoviral particle generally includes a targeting ligand as described 
above. The presence of the targeting ligand permits the delivery of a gene to a 
desired cell type which is different from the cell type that wild-type adenovirus 
25 particles infect or the same as that a wild-type particle infects, but allowing the 
infection in a selective manner, i.e., non-target cell types are not significantly 
infected. 

The adenoviral vectors provided herein can be used to study cell 
transduction and gene expression in vitro or in various animal models. The latter 
30 case includes ex vivo techniques, in which cells are transduced in vitro and then 
administered to the animal. They also can be used to conduct gene therapy on 
humans or other animals. Such gene therapy can be ex vivo or in vivo. For in 
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vivo gene therapy, the adenoviral particles in a pharmaceutically-acceptable 
carrier are delivered to a human in a therapeutically effective amount in order to 
prevent, treat, or ameliorate a disease or other medical condition in the human 
through the introduction of a heterologous gene that encodes a therapeutic 
5 protein into cells in such human. The adenoviruses are delivered at a dose 
ranging from approximately 1 particle per kilogram of body weight to 
approximately 10 14 particles per kilogram of body weight. Generally, they are 
delivered at a dose of approximately 1 0 6 particles per kilogram of body weight to 
approximately 10 13 particles per kilogram of body weight, and typically the dose 

10 ranges from approximately 10 s particles per kilogram of body weight to 
approximately 10 12 particles per kilogram of body weight. 

Any vectors known to those of skill in the art can be employed and used 
to produce viral particles that include fibers modified to ablate (including reduce) 
binding to HSP. Some exemplary vectors are as follows. 

15 a. Gutless vectors 

Gutted adenovirus vectors are those from which most or all viral genes 
have been deleted. They are grown by co-infection of the producing cells with a 
"helper" virus (such as using an El-deleted Ad vector), where the packaging cells 
expresses the E1 gene products. The helper virus trans- complements the 

20 missing Ad functions, including production of the viral structural proteins needed 
for particle assembly. To incorporate the capsid modifications into a gutted 
adenoviral vector capsid, the changes must be made to the helper virus as 
described herein. All the necessary Ad proteins including the modified capsid 
protein are provided by the modified helper virus, and the gutted adenovirus 

25 particles are equipped with the particular modified capsid expressed by the host 
cells. The E1a, Eb, E2a, E2b and E4 are generally required for viral replication 
and packaging. If these genes are deleted, then the packaging cell must provide 
these genes or functional equivalents. 

A helper adenovirus vector genome and a gutless adenoviral vector 

30 genome are delivered to packaging cells. The cells are maintained under 

standard cell maintenance or growth conditions, whereby the helper vector 
genome and the packaging cell together provide the complementing proteins for 
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the packaging of the adenoviral vector particle. Such gutless adenoviral vector 
particles are recovered by standard techniques. The helper vector genome can 
be delivered in the form of a plasmid or similar construct by standard 
transfection techniques, or it can be delivered through infection by a viral particle 
5 containing the genome. Such viral particle is commonly called a helper virus. 
Similarly, the gutless adenoviral vector genome can be delivered to the cell by 
transfection or viral infection. 

jhe helper virus genome can be the modified adenovirus vector genome 
as disclosed herein. Such genome also can be prepared or designed so that it 
0 lacks the genes encoding the adenovirus E1A and E1B proteins. In addition, the 
genome can further lack the adenovirus genes encoding the adenovirus E3 
proteins. Alternatively, the genes encoding such proteins can be present but 
mutated so that they do not encode functional E1A, E1B and E3 proteins. 
Furthermore, such vector genome can not encode other functional early proteins, 
5 such as E2A, E2B3, and E4 proteins. Alternatively, the genes encoding such 
other early proteins can be present but mutated so that they do not encode 
functional proteins. 

In producing the gutless vectors, the helper virus genome is also 
packaged, thereby producing helper virus. In order the minimize the amount of 
20 helper virus produced and maximize the amount pi gutless vector particles 

produced, the packaging sequence in the helper virus genome can be deleted or 
otherwise modified so that packaging of the helper virus genome is prevented or 
limited. Since the gutless vector genome will have an unmodified packaging 
sequence, it will be preferentially packaged. 
25 One way to do this is to mutate the packaging sequence by deleting one 

or more of the nucleotides comprising the sequence or otherwise mutating the 
sequence to inactivate or hamper the packaging function. One exemplary 
approach is to engineer the helper genome so that recombinase target sites flank 
the packaging sequence and to provide a recombinase in the packaging cell. The 
30 action of recombinase on such sites results in the removal of the packaging 

sequence from the helper virus genome. The recombinase can be provided by a 
nucleotide sequence in the packaging cell that encodes the recombinase. Such 
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sequence can be stably integrated into the genome of the packaging cell. 
Various kinds of recombinase are known by those skilled in the art, and include, 
but are not limited to, Cre recombinase, which operates on so-called lox sites, 
which are engineered on either side of the packaging sequence as discussed 
5 above (see, e.g., U.S. Patent Nos. 5,919, 676, 6,080,569 and 5,919,676; see, 
also, e.g., Morsy and Caskey, Molecular Medicine Today, Jan. 1999, pgs. 
18-24). 

An example of a gutless vector is pAdARSVDys (Haecker et al. (1996) 
Hum Gene Then 7:1907-1914)). This plasmid contains a full-length human 

10 dystrophin cDNA driven by the RSV promoter and flanked by Ad inverted 
terminal repeats and packaging signals. 293 cells are infected with a 
first-generation Ad, which serves as a helper virus, and then transfected with 
purified pAdARSVDys DNA. The helper Ad genome and the pAdARSVDys DNA 
are replicated as Ad chromosomes, and packaged into particles using the viral 

15 proteins produced by the helper virus. Particles are isolated and the 

pAdARSVDys-containing particles separated from the helper by virtue of their 
smaller genome size and therefore different density on CsCI gradients. Other 
examples of gutless adenoviral vectors are known (see, e.g., Sandig et al. 
(2000) Proc. Natl. Acad. Sci. U.S.A. 57^:1002-7). 

20 b. Oncolytic vectors 

Briefly, oncolytic adenoviruses, which are viruses that replicate selectively 
in tumor cells, are designed to amplify the input virus dose due to viral 
replication in the tumor, leading to spread of the virus throughout the tumor 
mass. In situ replication of adenoviruses leads to cell lysis. This in situ replication 

25 permits relatively low, non-toxic doses to be highly effective in the selective 

elimination of tumor cells. One approach to achieving selectivity is to introduce 
loss-of-function mutations in viral genes that are essential for growth in 
non-target cells but not in tumor cells. (See, e.g., U.S. Patent No. 5,801,029.) 
This strategy is exemplified by the use of Addl1520, which has a deletion in the 

30 E1b-55KD gene. In normal cells, the adenoviral E1b-55KD protein is needed to 
bind to p53 to prevent apoptosis. In p53-deficient tumor cells, E1b-55K binding 
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to p53 is unnecessary. Thus, deletion of E1b-55KD should restrict vector 
replication to p53-deficient tumor cells. 

Another approach is to use tumor-selective promoters to control the 
expression of early viral genes required for replication (see, e.g., International 
5 PCT application Nos. WO 96/17053 and WO 99/25860). Thus, in this approach 
the adenoviruses selectively replicate and lyse tumor cells if the gene that is 
essential for replication is under the control of a promoter or other transcriptional 
regulatory element that is tumor-selective. 

For example oncolytic adenoviral vectors that contain a cancer selective 

10 regulatory region operatively linked to an adenoviral gene essential for adenoviral 
replication are known (see, e.g., U.S. Patent No. 5,998,205). Adenoviral genes 
essential for replication include, but are not limited to, E1a, E1b, E2a, E2b and 
E4. For example, ah exemplary oncolytic adenoviral vector has a cancer 
selective regulatory region operatively linked to the E1a gene. In other 

15 embodiments, the oncolytic adenoviral vector has a cancer selective regulatory 
region of the present invention operatively linked to the E1a gene and a second 
cancer selective regulatory region operatively linked to the E4 gene. The vectors 
also can include at least one therapeutic transgene, such as, but not limited to, a 
polynucleotide encoding a cytokine such as GM-CSF that can stimulate a 

20 systemic immune response against tumor cells. 

Other exemplary oncolytic adenoviral vectors include those in which 
expression of an adenoviral gene, which is essential for replication, is controlled 
by E2F-responsive promoters, which are selectively transactivated in cancer 
cells. Thus, vectors that contains an adenoviral nucleic acid backbone that 

25 contains in sequential order: A left ITR, an adenoviral packaging signal, a 
termination signal sequence, an E2F responsive promoter which is operably 
linked to a first gene, such as E1a, essential for replication of the recombinant 
viral vector and a right ITR (see, published International PCT application No. 
WO02/06786, and U.S. Patent No. 5,998,205). 

30 In other embodiments, the oncolytic adenoviral vector has a cancer 

selective regulatory region operatively linked to the E1a gene and a second 
cancer selective regulatory region operatively linked to the E4 gene. The vectors 
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can also carry at least one therapeutic transgene, such as, but not limited to, a 
polynucleotide encoding a cytokine such as GM-CSF that can stimulate a 
systemic immune response against tumor cells. 
3. Packaging 

5 The viral particles provided herein can be made by any method known to 

those of skill in the art. Generally they are prepared by growing the adenovirus 
vector that contains nucleic acid that encodes the modified fiber protein in a 
standard adenovirus packaging cells to produce particles that express the 
modified fibers. Alternatively, the vectors do not encode fibers. Such vectors 
10 are packaged in producer cells to produce particles that express the modified 
fiber proteins. 

As discussed, recombinant adenoviral vectors generally have at least a 
deletion in the first viral early gene region, referred to as E1, which includes the 
E1a and E1b regions. Deletion of the viral E1 region renders the recombinant 

15 adenovirus defective for replication and incapable of producing infectious viral 
particles in subsequently-infected target cells. Thus, to generate E1 -deleted 
adenovirus genome replication and to produce virus particles requires a system 
of complementation which provides the missing E1 gene product. E1 
complementation is typically provided by a cell line expressing El, such as the 

20 human embryonic kidney packaging cell line, i.e. an epithelial cell line, called 
293. Cell line 293 contains the E1 region of adenovirus, which provides E1 
gene region products to "support" the growth of E1 -deleted virus in the cell line 
(see, e.g., Graham et aL, J. Gen. Virol. 36: 59-71, 1977). Additionally, cell lines 
that may be usable for production of defective adenovirus having a portion of 

25 the adenovirus E4 region have been reported (WO 96/22378). Multiply deficient 
adenoviral vectors and complementing cell lines have also been described (WO 
95/34671, U.S. Patent No. 5,994,106). 

For example, copending U.S. application Serial No. 09/482,682 (also 
filed as International PCT application No. PCT/EPOO/00265, filed January 14, 

30 200, published as International PCT application No. WO/0042208) provides 

packaging cell lines that support viral vectors with deletions of major portions of 
the viral genome, without the need for helper viruses and also provides cell lines 
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and helper viruses for use with helper-dependent vectors. The packaging cell 
line has heterologous DNA stably integrated into the chromosomes of the cellular 
genome. The heterologous DNA sequence encodes one or more adenovirus 
regulatory and/or structural polypeptides that complement the genes deleted or 
5 mutated in the adenovirus vector genome to be replicated and packaged. 
Packaging cell lines express, for example, one or more adenovirus structural 
proteins, polypeptides, or fragments thereof, such as penton base, hexon, fiber, 
polypeptide Ilia, polypeptide V, polypeptide VI, polypeptide VII, polypeptide VIII, 
and biologically active fragments thereof. The expression can be constitutive or 

10 under the control of a regulatable promoter. These cell lines are particularly 
designed for expression of recombinant adenoviruses intended for delivery of 
therapeutic products. For use herein, such packaging cell lines can express the 
modified capsid proteins, such as the fiber proteins who binding to HSP is 
reduced or eliminated, and/or the modified penton and hexon proteins. 

15 Particular packaging cell lines complement viral vectors having a deletion 

or mutation of a DNA sequence encoding an adenovirus structural protein, 
regulatory polypeptides E1A and E1B, and/or one or more of the following 
regulatory proteins or polypeptides: E2A, E2B, E3, E4, L4, or fragments thereof. 
The packaging cell lines are produced by introducing each DNA molecule 

20 into the cells and then into the genome via a separate complementing plasmid or 
plurality of DNA molecules encoding the complementing proteins can be 
introduced via a single complementing plasmid. Of interest herein, is a variation 
in which the complementing plasmid includes DNA encoding adenovirus fiber 
protein (or a chimeric or modified variant thereof), from Ad virus of subgroup D, 

25 such as Ad 37, polypeptide or fragment thereof. ' 

For applications, such as therapeutic applications, the delivery plasmid 
further can include a nucleotide sequence encoding a heterologous polypeptide. 
Exemplary delivery plasmids include, but are not limited to, pDV44, pAE1B£-gal 
and pAE1sp1B. In a similar or analogous manner, therapeutic nucleic acids, 

30 such as nucleic acids that encode therapeutic genes, can be introduced. 



WO 2003/062400 



PCT/US2003/002295 



-44- 

The cell further includes a complementing plasmid encoding a fiber as 
contemplated herein; the plasmid or portion thereof is integrated into a 
chromosome(s) of the cellular genome of the cell. 

Typically, the packaging cell lines will contain nucleic acid encoding the 
5 fiber protein or modified protein stably integrated into a chromosome or 

chromosomes in the cellular genome. The packaging cell line can be derived from 
a procaryotic cell line or from a eukaryotic cell line. While various embodiments 
suggest the use of mammalian cells, and more particularly, epithelial cell lines, a 
variety of other, non-epithelial cell lines are used in various embodiments. Thus, 

10 while various embodiments disclose the use of a cell line selected from among 
the 293, A549, W1 62, HeLa, Vero, 211, and 21 1 A cell lines, and any other cell 
lines suitable for such use are likewise contemplated herein. 
D. Addition of a targeting ligand 

The viral particles that are detargeted as described herein, can be 

15 retargeted to selected cells and/or tissues by inclusion of an appropriate 

targeting ligand in the capsid. The ligand cam be included in any of the capsid 
proteins, such as fiber, hexon and penton. Loci for inclusion of nucleic acid 
encoding a is known to those of skill in the art for a a variety of adenovirus 
serotypes; if necessary appropriate loci and other parameters can be empirically 

20 determined. 

The ligand can be produced as a fusion by inclusion of the coding 
sequences in the nucleic acid encoding a capsid protein, or chemically 
conjugated, such as via ionic, covalent or other interactions, to the capsid or 
bound to the capsid {e.g., by Ab-ligand fusion, where Ab binds capsid protein; or 

25 by disulfide bonding or other crosslinking moieties or chemistries). 

Thus, for example, a modified fiber nucleic acid also can include 
sequences of nucleotides that encode a targeting ligand to produce viral particles 
that include a targeting ligand in the capsid. Targeting ligand and methods for 
including such ligands in viral capids are well known. For example, inclusion of 

30 targeting ligands in fiber proteins is described in U.S. Patent Nos. 5,543,328 and 
5,756,086 and in U.S. Patent Application Serial No. 09/870,203, published as 
U.S. Published application No. 20020137213, and International Patent 
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Application No. PCT/EP01 /06286. For different serotypes and strains of 
adenoviruses, loci for insertion of targeting ligands can be empirically 
determined. For different serotypes and strains, such loci can vary. 

Because the adenovirus fiber has a trimeric structure, the ligand can be 
5 selected or designed to have a trimeric structure so that up to three molecules of 
the ligand are present for each mature fiber. Such ligands can be incorporated 
into the fiber protein using methods known in the art (see, e.g., U.S. Patent No. 
5,756,086). Instead of the fiber, the targeting ligand can be included in the 
penton or hexon proteins. Inclusion of targeting ligands in penton (see for 

10 example, in U.S. Patent Nos. 5,731,190 and 5,965,431) and in hexon (see for 
example, in U.S. Patent No. 5,965,541) is known. 

In one exemplary embodiment, the ligand is included in a fiber protein, 
which is a fiber protein mutated as described herein. As shown herein, the 
targeting ligand can be included, for example, within the HI loop of the fiber 

15 protein. Any ligand that can fit in the Hi loop and still provide a functional virus 
is contemplated herein. Such ligands can be as long as or longer than 80-100 
amino acids (see, e.g., Belousova et al. (2002) J. Virol. 75:8621-8631). Such 
ligands are added by techniques known in the art (see, e.g., published 
International Patent Application publication No. W099/39734 and U.S. Patent 

20 Application number 09/482,682). Other ligands can be be discovered through 
techniques known to those skilled in the art. Some non-limiting examples of 
these techniques include phage display libraries or by screening other types of 
libraries. 

Targeting ligands include any chemical moiety that preferentially directs 
25 an adenoviral particle to a desired cell type and/or tissue. The categories of such 
ligands include, but are not limited to, peptides, polypeptides, single chain 
antibodies, and multimeric proteins. Specific ligands include the TNF superfamily 
of ligands which include tumor necrosis factors (or TNF's) such as, for example, 
TNFa and TNF/?, lymphotoxins (LT), such as LT-a and LT-£, Fas ligand which 
30 binds to Fas antigen; CD40 ligand, which binds to the CD40 receptor of 

B-lymphocytes; CD30 ligand, which binds to the CD30 receptor of neoplastic 
cells of Hodgkin's lymphoma; CD27 ligand, NGF ligand, and OX-40 ligand; 
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transferrin, which binds to the transferrin receptor located on tumor cells, 
activated T -cells, and neural tissue cells; ApoB, which binds to the LDL receptor 
of liver cells; alpha-2-macroglobulin, which binds to the LRP receptor of liver 
cells; alpha-l acid glycoprotein, which binds to the asialoglycoprotein receptor of 
5 liver; mannose-containing peptides, which bind to the mannose receptor of 
macrophages; sialyl-Lewis-X antigen-containing peptides, which bind to the 
ELAM-I receptor of activated endothelial cells; CD34 ligand, which binds to the 
CD34 receptor of hematopoietic progenitor cells; ICAM-I, which binds to the 
LFA-I (CD11b/CD18) receptor of lymphocytes, or to the Mac-I (CD11a/CD18) 

10 receptor of macrophages; M-CSF, which binds to the c-fms receptor of spleen 
and bone marrow macrophages; circumsporozoite protein, which binds to 
hepatic Plasmodium falciparum receptor of liver cells; VLA-4, which binds to the 
VCAM-I receptor of activated endothelial cells; HIV gp120 and Class II MHC 
antigen, which bind to the CD4 receptor of T -helper cells; the LDL receptor 

15 binding region of the apolipoprotein E (ApoE) molecule; colony stimulating factor, 
or CSF, which binds to the CSF receptor; insulin-like growth factors, such as 
IGF-I and IGF-II, which bind to the IGF-I and IGF-II receptors, respectively; 
Interleukins 1 through 14, which bind to the Interleukin 1 through 14 receptors, 
respectively; the Fv antigen-binding domain of an immunoglobulin; gelatinase 

20 (MMP) inhibitor; bombesin, gastrin-releasing peptide; substance P; somatostatin; 
luteinizing hormone releasing hormone (LHRH); vasoactive peptide (VIP); gastrin; 
melanocyte stimulating hormone (MSH); cyclic RGD peptide and any other ligand 
or cell surface protein-binding (or targeting) molecule. 
E. Heterologous polynucleotides and Therapeutic Nucleic Acids 

25 The packaged adenoviral genome also can contain a heterologous 

polynucleotide that encodes a product of interest, such as a therapeutic protein. 
Adenoviral genomes containing heterologous polynucleotides are well known 
(see, e.g., U.S. Patent Nos. 5,998,205, 6,156,497, 5,935,935, and 
5,801,029). These can be used for in vitro and in vivo delivery of the products 

30 of heterlogous polynucleoties or the heterologous polynucleotides. 

Thus, the adenoviral particles provided herein can be used to engineer a 
cell to express a protein that it otherwise does not express or does not express 
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in sufficient quantities. This genetic engineering is accomplished by infecting the 
desired cell with an adenoviral particle whose genome includes a desired 
heterologous polynucleotide. The heterologous polynucleotide is then expressed 
in the genetically engineered cells. For use herein the cell is generally a 
5 mammalian cell, and is typically a primate cell, including a human cell. The cell 
can be inside the body of the animal (in vivo) or outside the body (in vitro). 
Heterologous polynucleotides (also referred to as heterologous nucleic acid 
sequences) are included in the adenoviral genome within the particle and are 
added to that genome by techniques known in the art. Any heterologous 

10 polynucleotide of interest can be added, such as those disclosed in U.S. Patent 
No. 5,998,205, incorporated herein by reference. Polynucleotides that are 

introduced into an Ad genome or vector can be any that encode a protein of 
interest or that are regulatory sequences. Proteins include, but are not limited 
to, therapeutic proteins, such as an immunostimulating protein, such as an 

15 interleukin, interferon, or colony stimulating factor, such as granulocyte 
macrophage colony stimulating factor (GM-CSF; see, e.g., 5,908,763F. 
Generally, such GM-CSF is a primate GM-CSF, including human GM-CSF. Other 
immunostimulatory genes include, but are not limited to, genes that encode 
cytokines IL1, IL2, IL4, IL5, IFN, IFN, TNF, IL12, IL18, and flt3), proteins that 

20 stimulate interactions with immune cells (B7, CD28, MHC class I, MHC class II, 
TAPs), tumor-associated antigens (immunogenic sequences from MART-1, 
gp100(pmel-17), tyrosinase, tyrosinase-related protein 1 , tyrosinase-related 
protein 2, melanocyte-stimulating hormone receptor, MAGE1, MAGE2, MAGE3, 
MAGE12, BAGE, GAGE, NY-ESO-1 , -catenin, MUM-1, CDK-4, caspase 8, KIA 

25 0205, HLA-A2R1701, -fetoprotein, telomerase catalytic protein, G-250, MUC-1, 
carcinoembryonic protein, p53, Her2/neu, triosephosphate isomerase, CDC-27, 
LDLR-FUT, telomerase reverse transcriptase, and PSMA), cDNAs of antibodies 
that block inhibitory signals (CTLA4 blockade), chemokines (MIP1 , MIPS, CCR7 
ligand, and calreticulin), and other proteins. 

30 Other polynucleotides, including therapeutic nucleic acids, such as 

therapeutic genes, of interest include, but are not limited to, anti-angiogenic, and 
suicide genes. Anti-angiogenic genes include, but are not limited to, genes that 



WO 2003/062400 



PCT/US2003/002295 



-48- 

encode METH-1 , METH -2, TrpRS fragments, proliferin-related protein, prolactin 
fragment, PEDF, vasostatin, various fragments of extracellular matrix proteins 
and growth factor/cytokine inhibitors. Various fragments of extracellular matrix 
proteins include, but are not limited to, angiostatin, endostatin, kininostatin, 
5 fibrinogen-E fragment, thrombospondin, tumstatin, canstatin, and restin. Growth 
factor/cytokine inhibitors include, but are not limited to, VEGF/VEGFR 
antagonist, sFlt-1, sFlk, sNRP1, angiopoietin/tie antagonist, sTie-2, chemokines 
(IP-10, PF-4, Gro-beta, IFN-gamma (Mig), IFN, FGF/FGFR antagonist (sFGFR), 
Ephrin/Eph antagonist (sEphB4 and sephrinB2), PDGF, TGF and IGF-1 . 

10 A "suicide gene" encodes a protein that can lead to cell death, as with 

expression of diphtheria toxin A, or the expression of the protein can render cells 
selectively sensitive to certain drugs, e.g., expression of the Herpes simplex 
thymidine kinase gene (HSV-TK) renders cells sensitive to antiviral compounds, 
such as acyclovir, gancyclovir and FIAU (1 -(2-deoxy-2-fluoro--.beta.- 

15 D-arabinofuranosil)-5-iodouracil). Other suicide genes include, but are not limited 
to, genes that encode carboxypeptidase G2 (CPG2), carboxylesterase (CA), 
cytosine deaminase (CD), cytochrome P450 (cyt-450), deoxycytidine kinase 
(dCK), nitroreductase (NR), purine nucleoside phosphorylase (PNP), thymidine 
phosphorylase (TP), varicella zoster virus thymidine kinase (VZV-TK), and 

20 xanthine-guanine phosphoribosyl transferase (XGPRT). Alternatively, a 

therapeutic nucleic acid can exert its effect at the level of RNA, for instance, by 
encoding an antisense message or ribozyme, a protein that affects splicing or 3' 
processing (e.g., polyadenylation), or a protein that affects the level of 
expression of another gene within the cell, e.g. by mediating an altered rate of 

25 mRNA accumulation, an alteration of mRNA transport, and/or a change in 

post-transcriptional regulation. The addition of a therapeutic nucleic acid to a 
virus results in a virus with an additional antitumor mechanism of action. Thus, 
a single entity (i.e., the virus carrying a therapeutic transgene) is capable of 
inducing multiple antitumor mechanisms. Other encoded proteins, include, but 

30 are not limited to, herpes simplex virus thymidine kinase (HSV-TK), which is 
useful as a safety switch (see, U.S. Patent Application No. 08/974,391, filed 
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November 19, 1997, which published as PCT Publication No. WO/9925860), 

Nos, FasL, and sFasR (soluble Fas receptor). 

Also contemplated are combinations of two or more transgenes with 

synergistic, complementary and/or nonoverlapping toxicities and methods of 
5 action. The resulting adenovirus can retain the viral oncolytic functions and, for 

example, additionally are endowed with the ability to induce immune and 

anti-angiogenic responses and other responses as desired. 

Therapeutic polynucleotides and heterologous polynucleotides also 

include those that exert an effect at the level of RNA or protein. These include 
10 include a factor capable of initiating apoptosis, RNA, such as RNAi and other 

double-stranded RNA, antisense and ribozymes, which among other capabilities 

can be directed to mRNAs encoding proteins essential for proliferation, such as 

structural proteins, transcription factors, polymerases, genes encoding cytotoxic 

proteins, genes that encode an engineered cytoplasmic variant of a nuclease 
15 (e.g. RNase A) or protease (e.g. trypsin, papain, proteinase K and carboxy- 

peptidase). Other polynucleotides include a cell or tissue specific promoters, 

such as those used in oncolytic adenoviruses (see, e.g., U.S. Patent No. 

5,998,205). 

The heterologous polynucleotide encoding a polypeptide also can contain 
20 a promoter operably linked to the coding region. Generally the promoter is a 
regulated promoter and transcription factor expression system, such as the 
published tetracycline-regulated systems, or other regulatable systems (WO 
01/30843), to allow regulated expression of the encoded polypeptide. 
Exemplary of other promoters, are tissue-selective promoters, such as those 
25 described in U.S. Patent No. 5,998,205. An exemplary regulatable promoter 
system is the Tet-On( and Tet-Off ( systems currently available from Clontech 
(Palo Alto, CA). This promoter system allows the regulated expression of the 
transgene controlled by tetracycline or tetracycline derivatives, such as 
doxycycline. This system can be used to control the expression of the encoded 
30 polypeptide in the viral particles and nucleic acids provided herein. Other 
regulatable promoter systems are known (see, e.g., published U.S. No. 
20020168714, entitled "Regulation of Gene Expression Using Single-Chain, 
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Monomeric, Ligand Dependent Polypeptide Switches," which describes gene 
switches that contain ligand binding domains and transcriptional regulating 
domains, such as those from hormone receptors). Other suitable promoters that 
can be employed include, but are not limited to, adenoviral promoters, such as 
5 the adenoviral major late promoter and/or the E3 promoter; or heterologous 
promoters, such as the cytomegalovirus (CMV) promoter; the Rous Sarcoma 
Virus (RSV) promoter; inducible promoters, such as the MMT promoter, the 
metallothionein promoter; heat shock promoters; the albumin promoter; and the 
ApoAl promoter. 

10 Therapeutic transgenes can be included in the viral constructs and 

resulting particles. Among these are those that result in an "armed" virus. For 
example, rather than delete E3 region as in some embodiments described herein, 
all or a part of the E3 region can be preserved or re-inserted in an oncolytic 
adenoviral vector (discussed above). The presence of all or a part of the E3 

15 region can decrease the immunogenicity of the adenoviral vector. It also 

increases cytopathic effect in tumor cells and decreases toxicity to normal cells. 
Typically such vector expresses more than half of the E3 proteins. 

Adenoviruses for therapy, including those for human therapy, are known. 
Such known viruses can be modified as provided herein to reduce or eliminate 

20 interaction with HSPs and optionally additional receptors. The adenoviral vectors 
that are used to produce the viral particles can include other modifications. 
Modifications include modifications to the adenovirus genome that is packaged 
in the particle in order to make an adenoviral vector. As discussed above, 
adenovirus vectors and particles with a variety of modifications are available. 

25 Modifications to adenvoiral vectors include deletions known in the art, such as 
deletions in one or more of the El, E2a, E2b, E3, or E4 coding regions. These 
adenoviruses are sometimes referred to as early generation adenoviruses, include 
those with deletions of all of the coding regions of the adenoviral genome 
("gutless" adenoviruses, discussed above) and also include replication-condi- 

30 tional adenoviruses, which are viruses that replicate in certain types of cells or 

tissues but not in other types as a result of placing adenoviral genes essential for 
replication under control of a heterologous promoter (discussed above; see, also 
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U.S. Patent No. 5,998,205, U.S. Patent No. 5,801,029; U.S. patent application 
60/348,670 and corresponding published international PCT application No. 
WO02/06786). These include the cytolytic, cytopathic viruses (or vectors), 
including the oncolytic viruses discussed above. 
5 Alternatively, as discussed above, the vector can include a mutation or 

deletion in the E1b gene. Typically such mutation or deletion in the E1b gene is 
such that the E1b-19kD protein becomes non-functional. This modification of 
the E1b region can be combined with vectors where all or a part of the E3 region 
is present. 

10 The oncolytic adenoviral vector can further include at least one 

heterologous coding sequence, such as one that encodes a therapeutic product. 
The heterologous coding sequence, such as therapeutic gene, is generally, 
although not necessarily, in the form of cDNA, and can be inserted at any locus 
that does not adversely affect the infectivity or replication of the vector. For 

15 example, it can be inserted in an E3 region in place of at least one of the 

polynucleotide sequences that encode an E3 protein, such as, for example, the 
19kD or 14.7 kD E3 gene. 
F. Propagation and Scale-up 

Since doubly ablated adenoviral vectors containing mutations in the fiber 

20 and/or penton capsid proteins result in inefficient cell binding and entry via the 
CAR/crv integrin entry pathway, scaled up technologies improve the growth and 
propagation of such vectors to produce high titers of the adenoviral vectors for 
clinical use. Thus, also provided is a method for scaling up the production of 
detargeted adenoviral vectors. The detargeted adenoviral vectors comprise an 

25 adenoviral vector modified to ablate the interaction of said vector with at least 
one host cell receptor compared with a wild-type adenoviral vector. The 
detargeted adenoviral vectors can comprise an adenoviral vector modified to 
ablate the interaction of said vector with one, two, three or more host cell 
receptors. Thus, the method is suitable for producing the detargeted adenoviral 

30 vectors disclosed herein. 

As noted, growth and propagation of doubly and fully ablated adenoviral 
vectors is enhanced by new scale up technologies. Doubly ablated vectors 
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contain mutations in the fiber and penton capsid proteins that result in inefficient 
cell binding and entry via the normal cellular entry pathway using CAR and 
integrins. These vectors are fully detargeted in vitro and, thus, alternative 
cellular entry strategies allow for the efficient growth and generation of high titer 
5 preparations. 

Two strategies have been envisioned to scale up vectors that are 
detargeted via fiber and/or penton modifications. These include: (a) the use of 
pseudoreceptor cell lines engineered to express a surface receptor that binds a 
ligand displayed on the vector (see, e.g., International PCT application No. WO 

10 98/54346) and (b) complementing cell lines that are engineered to express 

native fiber and that can be engineered to express native fiber and penton (see, 
e.g., International PCT application No. WO 00/42208). Although these systems 
have shown promise for scaling up ablated adenoviral vectors, there is a need to 
develop a system for the simple, efficient production of the fully detargeted 

15 adenoviral vector for therapeutic uses. 

Provided herein is a scale-up method for the propagation of detargeted 
adenoviral vectors. The method uses polycations and/or bifunctional reagents, 
which when added to tissue culture medium, bind adenoviral particles and direct 
their entry into the producer cells. 

20 Reagents (also called medium additives) also can be included in the tissue 

culture medium containing producer cells to be infected with the detargeted 
adenoviral vectors. Alternatively the reagents can be pre-mixed with the virus, 
which mixture is then added to the tissue producer cells. The reagents can be 
added to tissue culture medium containing producer cells, or producer cells can 

25 be added to tissue culture medium containing the reagents. Any suitable 

producer cell known to the skilled artisan can be used in the present methods. 
The reagents can be added at the same time that the producer cells are infected 
with detargeted adenoviral vectors. Generally the reagents are present in the 
tissue culture medium prior to infection by the detargeted adenoviral vectors. 

30 The medium additives are maintained in the tissue culture medium during vector 
growth, spread and propagation. High titer yields of adenoviral vectors are 
obtained by this method. 
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Reagents which are useful in this method are those that are capable of 
directing adenoviral particle entry into the producer cells. Such reagents include, 
but are not limited to, polycations and bifunctional reagents. Suitable 
polycations include, but are not limited to, polytheylenimine; protamine sulfate; 
5 poly-L-lysine hydrobromide; poly(dimethyl diallyl ammonium) chloride 

(Merquat(r)-IOO, Merquat(r)280, Merquat(r)550); poly-L-arginine hydrochloride; 
poly-L-histidine; poly(4-vinylpyridine), poly(4-vinylpyridine) hydrochloride; 
poly(4-vinylpyridine)cross-Iinked, methylchloride quaternary salt; poly(4-vinyl- 
■ pyridine-co-styrene); poly(4-vinylpyridinium poly(hydrogen fluoride)); poly(4-vinyl- 

10 pyridinium-P-toluene sulfonate); poly(4-vinylpyridinium-tribromide); poly(4-vinyl- 
pyrrolidone-co-2-dimethylamino-ethyl methacrylate); polyvinylpyrrolidone, 
cross-linked; poly vinylpyrrolidone, poly(melamine-co-formaldehyde); partially 
methylated; hexadimethrine bromide; poly(Glu, Lys) 1 :4 hydrobromide; poly(Lys, 
Ala) 3:1 hydrobromide; poly(Lys, Ala) 2:1 hydrobromide; poly-L-lysine 

15 succinylated; poly(Lys, Ala) 1:1 hydrobromide; and poly(Lys, Trp) 1:4 
hydrobromide. 

Suitable bifunctional reagents include, but are not limited to, antibodies or 
peptides that bind to the adenoviral capsid and that also contain a ligand that 
allows interaction with specific cell surface receptors of the producer cells. 
20 Examples of bifunctional reagents include: (a) anti-fiber antibody ligand fusions, 
(b) anti-fiber-Fab-FGF conjugate, (c) anti-penton-antibody ligand fusions, (d) 
anti-hexon antibody ligand fusions and (e) polylysine-peptide fusions. The ligand 
is any ligand that will bind to any cell surface receptor found on the producer 
cells. 

25 

The following examples are included for illustrative purposes only and are 
not intended to limit the scope of the invention. 
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EXAMPLE 1 

Construction of Ad5 Vectors Containing the Fiber AB Loop, KOI and Penton, 
PD1 Mutations and Derivatives Thereof 

Three recombinant adenoviral vectors were prepared that contain the KOI 

5 fiber or PD1 penton base mutations either alone or in combination, these vectors 

are designated Av3nBgFK01 Av1nBgPD1, and Av1 nBgFKOI PD1 . Construction 

of these vectors is described below and a general description of each vector is 

set forth in Table 1 . 

TABLE 1 

0 Description Of Detargeted 

Recombinant Adenoviral Vectors Used For Scale-up 

Vector 



Vector 


Description 


Av3nBg 


An E1, E2a, E3-deieted adenoviral vector encoding a nuclear 
localizing /?-galactosidase 


AvInBg 


An E1 and E3-deleted adenoviral vector encoding a nuclear 
localizing /?-galactosidase 


Av3nBgFK01 


The same as Av3nBg but containing the KOI mutation in the 
fiber gene 


Av1 nBgPDl 


The same as AvInBg but containing the PD1 mutation in the 
penton gene 


Av1 nBgFKOI PD1 


The same as AvInBg but containing the fiber KOI and penton 
PD1 mutations 



20 AvInBg 

This is a well-known vector, its sequence is set forth in SEQ ID No. 43. 
Av3nBg 

This is a well-known vector, its sequence is set forth in SEQ ID No. 44. 
Av3nBgFK01 

25 Genetic incorporation of the KOI fiber mutation to generate Av3nBgFK01 

The adenoviral vector AvSnBgFKOI was generated in an E1-, E2a-, 
E3-deleted backbone based on the adenovirus serotype 5 genome. It contains a 
RSV promoted nuclear-localizing jS-galactosidase gene in place of the E1 region. 
In addition, the fiber gene carries the KOI mutation. This mutation results in a 
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substitution of fiber amino acids 408 and 409, changing them from serine and 
proline to glutamic acid and alanine, respectively. 

The vector was constructed as follows. First, the plasmid pSKOl (Figure 
1) was digested with the restriction enzymes Sphl and Muni. The resulting DNA 
5 fragments were separated by electrophoresis on an agarose gel. The 1601 bp 
fragment containing all but the 5' end of the fiber gene was excised from the 
agarose gel and the DNA was isolated and purified. The fragment was then 
ligated with the 9236 bp fragment of p5FloxHRFRGD, which had been digested 
with Sphl and Muni. The resulting plasmid, pSFIoxHRFKOI , was digested with 
0 { Spel and Pad and the 6867 bp fragment containing the fiber gene was isolated. 
The fragment was ligated with the 24,630 bp Spel-Pacl fragment of pNDSQ3.1. 
The resulting plasmid, pNDSQ3.1K01 (Figure 2), was used together with 
pAdmireRSVnBg (Figure 3A) to generate a plasmid which encodes the full-length 
adenoviral vector genome. It, however, was necessary to remove the Pad site 
5 from pNDSQ3.1K01 (Figure 2) prior to recombination with pAdmireRSVnBg 
(Figure 3A) so that the final plasmid contains a unique Pad site adjacent to the 
5' ITR. The Pad site in pNDSQ3.1K01 was removed by digestion with Pad 
followed by blunting with T4 DNA Polymerase and religation. The resulting 
plasmid was called pNDSQ3.1 KOI (Pac. 

20 To generate a full-length plasmid containing the entire adenoviral genome, 

pAdmireRSVnBg (Figure 3A) was digested with Sail and co-transfected into 
competent cells of the E. coli strain BJ5183 along with pNDSQ3.1 KOI APac, 
which had been digested with BstBI. Homologous recombination between the 
two plasmids generated a full-length plasmid encoding the entire adenoviral 

25 vector genome, which was called pFLAv3nBgFK01 . 

The plasmid pFLAv3nBgK01 was linearized with Pad and transfected into 
633 cells. In the fiber complementing 633 cell line, the resulting viral DNA 
containing the KOI mutation is capable of being packaged into infectious viral 
particles containing a mixture of wildtype fiber and mutant fiber proteins. After 

30 five rounds of amplification in 633 cells, a cytopathic effect was observed. 
Three more rounds of amplification in 633 cells were performed followed by 
purification of the virus by standard CsCI centrifugation procedures. This viral 
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preparation was used to infect AE1-2a cells, which do not express fiber. The 
resulting virus contained only the mutant fiber protein on its capsid. Virus 
particles were purified by standard CsCI centrifugation procedures. 
AvInBgFKOI 

5 The v ector AvInBgFKOI is made in a similar manner to AvSnBgFKOI 

described above. 
Av1nBgK012 

An additional fiber AB loop mutation (described by Einfeld et at. (2001) J. 
Virology 75:1 1 284-1 1 291 ) was incorporated into the genome of Av1 nBg. This 

10 AB loop mutation is a four amino acid substitution, R512S, A515G, E516G, and 
K517G, and is referred to as K012. The K012 mutation was incorporated into 
the fiber gene by PGR gene overlap extension using the plasmid pSQ1 (Figure 
3B) as template. The pSQ1 plasmid contains most of the Ad5 genome, 
extending from base pair 3329 through the right ITR, in a pBR322 backbone. 

15 First, a segment of the Ad5 genome extending from within the E3 region into the 
fiber gene was amplified by PGR using the plasmid pSQ1 as a template with the 
following primers termed 5FF, 5'-GAA CAG GAG GTG AGC TTA GA-3' SEQ ID 
No. 4), and 5FR, 5'-TCC GCC TCC ATT TAG TGA ACA GTT AGG AGA TGG 
AGC TGG TGT G-3' (SEQ ID No. 6). The primer 5FR contains an 18 base 

20 5'-extension that encodes the modified fiber AB loop amino acids from 512 
through 517. A second PCR using pSQ1 as a template amplified the region 
immediately 3' of the AB loop substitution and extending past the Muni site 
located 40 base pairs 3' of the fiber gene stop codon. The two primers used for 
this reaction were 3FF: 5'-TCA CTA AAT GGA GGC GGA GAT GCT AAA CTC 

25 ACT TTG GTC TTA AC-3' (SEQ ID No. 7), and 3FR: 5'-GTG GCA GGT TGA 
ATA CTA GG-3' (SEQ ID No. 8). The primer 3FR contains an 18 base 
5'-extension that encodes the modified fiber AB loop amino acids 512 through 
517. Amplified products of the expected size were obtained and used in a 
second PCR with the end primers 5FF and 3FR to join the fragments together. 

30 The KQ12 PCR fragment was digested with Xbal and Muni cloned directly into 
the fiber shuttle plasmid, pFBshuttle(EcoRI) to generate the plasmid pFBSEK012 
which contains the 8.8kB EcoRI fragment of pSQ1 . The pFBSEKQ12 plasmid 
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was digested with Xbal and EcoRI and cloned into pSQ1 using a three-way 
ligation to generate pSQ1K012 (Figure 3C). The K012 cDNA was incorporated 
into the genome of AvInBg, an adenovirus vector with E1 and E3 deleted 
encoding /?-galactosidase, by homologous recombination between Clal-linearized 
5 pSQ1K012 and pAdmireRSVnBg digested with Sail and Pad to generate 
Av1nBgK012. The K012 vector was transfected in 633 cells, scaled-up on 
non-fiber expressing cells and purified, as described above for KOI. 
AvInBgPDI 

Genetic incorporation of the PD1 penton mutation to generate 
10 AvInBgPDI 

The adenoviral vector AvInBgPDI is an E1-, E3-deleted vector based on 

the adenovirus serotype 5 genome. It contains a RSV promoted 

nuclear-localizing yff-galactosidase gene in the E1 region and also contains the 

PD1 mutation in the penton gene. The PD1 mutation results in a substitution of 

15 amino acids 337 through 344 of the penton protein, HAIRGDTF (SEQ ID No. 9), 
with amino acids SRGYPYDVPDYAGTS (SEQ ID No. 10), thus replacing the RGD 
tripeptide (see, Einfeld et aL (2001 ) J. Virology 75:11284-11291). The 
mutation in the penton gene was generated in the plasmid pGEMpen5, which 
contains the Adenovirus serotype 5 penton gene. To generate the mutation, 

20 four oligonucleotides were synthesized. The sequences of the oligonucleotides 
were as follows: penton 1 : 5' CGC GGA AGA GAA CTC CAA CGC GGC AGC 
CGC GGC AAT GCA GCC GGT GGA GGA CAT GAA 3' (SEQ ID No. 11); penton 
2: 5' TAT CGT TCA TGT CCT CCA CCG GCT GCA TTG CCG CGG CTG CCG 
CGT TGG AGT TCT CTT CC 3' (SEQ ID No. 12); penton 3: 5' CGA TAG CCG 

25 CGG CTA CCC CTA CGA CGT GCC CGA CTA CGC GGG CAC CAG CGC CAC 

ACG GGC TGA GGA GAA GCG CGC 3' (SEQ ID No. 13); penton 4: 5' TCA GCG 
CGC TTC TCC TCA GCC CGT GTG GCG CTG GTG CCC GCG TAG TCG GGC 
ACG TCG TAG GGG TAG CCG CGG C 3' (SEQ ID No. 14). The complementary 
oligonucleotides penton 1 and penton 2 were annealed to each other as were 

30 penton 3 and penton 4. The duplex generated by annealing penton 3 and penton 
4 encoded the substitution of amino acids 337 through 344 described above. 
The duplex generated by annealing penton 1 and penton 2 possessed a 5 base 
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5' overhang which was compatible to a 5 base 5' overhang on the duplex 
generated by annealing penton 3 and penton 4. The opposite end of the duplex 
generated by annealing penton 1 and penton 2 contained an Earl compatible 
overhang. The opposite end of the duplex generated by annealing penton 3 and 
5 penton 4 contained a BbvCI compatible overhang. The two duplexes were 

ligated to each other and ligated back into the pGEMpen5 backbone as follows. 
First, pGEMpenB was digested with BbvCI and Pstl and the resulting DNA 
fragments were separated by electrophoresis on an agarose gel. The 3360 bp 
fragment was excised from the gel and purified. The plasmid pGEMpenS was 
10 also digested with Pstl and Earl and the resulting fragments were separated by 
electrophoresis on an agarose gel. The 955 bp fragment was excised from the 
gel and purified. These two fragments from the pGEMpen5 plasmid were ligated 
with the two pairs of annealed oligonucleotides to generate the plasmid 
pGEMpen5PD1 . 

15 The mutated penton gene was transferred from pGEMpen5PD1 to pSQ1 

using a 5-way ligation as follows. First, the region of the penton gene 
containing the PD1 mutation was excised from pGEMpen5PD1 by digestion with 
Pvul and Ascl. The 974 bp fragment containing the PD1 mutation was purified. 
x Four DNA fragments were prepared from the pSQ1 plasmid (Figure 3B) as 

20 follows. The plasmid was digested with Csp45l and Fsel and the 9465 bp 

fragment was purified. In addition pSQ1 was digested with Fsel and Pvul and 
the 2126 bp fragment was purified. The plasmid pSQl was digested with Ascl 
and BamHI and the 5891 bp fragment was purified. Finally, pSQ1 was digested 
with BamHI and Csp45I and the 14610 bp fragment was purified. The 5 purified 

25 DNA fragments were ligated to each other to form the plasmid pSQ1PD1 (Figure 
4). 

To generate adenoviral vector, pSQ1PD1 was linearized by digestion with 
Clal and co-transfected into PerC6 cells with pAdmireRSVnBg (Figure 3A) which 
had been digested with Sail and Pad. hexadimethrine bromide was maintained 
30 in the medium at 4 /yg/ml. When a cytopathic effect was observed, a crude viral 
lysate was further expanded on PerC6 cells. The virus was purified by standard 
CsCI centrifugation procedures. 
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Av1 nBgFK0 1 PD 1 

Genetic incorporation of the fiber KOI or K012 mutation in combination 
with the penton PD1 mutation to generate Av1 nBgFK0 1 PD 1 

The adenoviral vectors Av1 nBgFKOI PD1 and Av1 nBgKOI 2PD1 were 

5 generated in an E1-, E3-deleted adenovirus serotype 5 genome. Both vectors 

contains a RSV promoted nuclear-localizing ^-galactosidase gene in the E1 region 

and also contains either the KOI or K01 2 mutation in the fiber gene as well as 

the PD1 mutation in the penton gene. The vectors were constructed as follows. 

First, the plasmid pSQ1PD1 was digested with Csp45l and Spel and the 23976 

0 bp fragment containing the PD1 mutated penton gene was purified. In addition, 

the plasmids pSQIKOI or pSQ1K012 (Figure 3B) were digested with Csp45l 

and Spel and the 9090 bp fragment containing the KOI or K012 mutated fiber 

gene were purified. The appropriate purified fragments were ligated to each 

other to from the plasmid pSQ1FK01PD1 (Figure 5A) or pSQ1K012PD1 (Figure 

5 5B) that contains the K01 (or K012) mutated fiber gene and the PD1 mutated 

penton gene. To generate virus, pSQ1FK01PD1 or pSQK012PD1 was linearized 

with Clal and co-transfected into 633 cells with pAdmireRSVnBg (Figure 3A) 

which had been digested with Sail and Pacl. After three rounds of amplification 

in 633 cells a cytopathic effect was observed and the crude viral lysate was 

20 then amplified on PerC6 cells. Hexadimethrine bromide was maintained in the 

medium at 4 ^/g/ml. Each virus was purified by standard CsCI centrifugation 

procedures. 

EXAMPLE 2 

In Vitro Evaluation of Adenoviral Vectors Containing the KOI and PD1 Mutations 

25 Several recombinant adenoviral vectors were used in these studies to 

demonstrate the function of the KOI fiber mutation and included AvInBg, 
Av1 nBgFKOI, Av1nBgPD1, and Av1 nBgFKOI PD1 , described above. The 
transduction efficiencies of adenoviral vectors containing the KOI and/or PD1 
mutations were evaluated on cells of the alveolar epithelial cell line A549. The 

30 transduction efficiencies were compared to that of AvInBg, an adenoviral vector 
containing wild type fiber and penton. 



WO 2003/062400 



PCT/US2003/002295 



-60- 

The day prior to infection, cells were seeded into 24-well plates at a 
density of approximately 1 x 10 5 cells per well. Immediately prior to infection, 
the exact number of cells per well was determined by counting a representative 
well of cells. Each of the vectors, AvInBg, AvInBgFKOI, and Av1 nBgFKOI PD1 
5 were used to transduce A549 cells at each of the following particle per cell 
(PPC) ratios: 100, 500, 1000, 2500, 5000, 10,000. The cell monolayers were 
stained with X-gal 24 hours after infection and the percentage of cells 
expressing jff-galactosidase was determined by microscopic observation and 
counting of cells. Transductions were done in triplicate and three random fields 
10 in each well were counted, for a total of nine fields per vector. 

The results at the 500 PPC ratio are shown in Figure 6 and show a 
significantly reduced transduction efficiency on A549 cells using vectors 
containing the KOI mutation alone or when combined with PD1 compared to 
AvInBg. The vectors containing the PD1 mutation alone had no effect on 
15 adenoviral transduction of A549 cells in vitro. 

EXAMPLE 3 

In Vivo Analysis of Adenoviral Vectors Containing the FKOI and PD1 Mutations 

This Example provides experiments that evaluate the in vivo 
biodistribution of adenoviral vectors containing the KOI and PD1 mutations and 

20 their influence on adenoviral-mediated liver transduction. The results show that 
ablating the viral interaction with CAR and/or integrins is not sufficient to fully 
detarget adenoviral vectors from the liver in vivo. 

A positive control cohort received AvInBg and a negative control group 
received HBSS. Additionally, the Av1 nBgFKOI 2 and Av1 nBgFKOI 2PD1 vectors 

25 were analyzed in vivo. These vectors each contain a fiber protein with the four 
amino acid substitution in the AB loop. Additionally, Av1 nBgFKOI 2PD1 
contains a mutation in the penton base. Both of these mutations were known 
(see, Einfeld et af. (2001 ) J. Virology 75'A 1 284-1 1 291 ), and were alleged to 
decrease liver transduction 10 to 700 fold, respectively. Cohorts of five 

30 C57BL/6 mice received each vector via tail vein injection at a dose of 1 x 10 13 

particles per kg. The animals were sacrificed approximately 72 hours after vector 
administration by carbon dioxide asphyxiation. Liver, heart, lung, spleen, and 
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kidney were collected from each animal. The median lobe of the liver was placed 
in neutral buffered formalin to preserve the sample for /?-galactosidase 
immunohistochemistry. In addition, tissue from each organ was frozen to 
preserve it for hexon PCR analysis to determine vector content. A separate 
5 sample of liver from each mouse was frozen to preserve it for a 
chemiluminescent /?-galactosidase activity assay. 

For /?-galactosidase immunohistochemistry slices of liver, approximately 
2-3 mm thick, were placed in 10% neutral buffered formalin. After fixation, 
these samples were embedded in paraffin, sectioned, and analyzed by 

10 immunohistochemistry for yff-galactosidase expression. A 1:1200 dilution was 
used of a rabbit anti-£-galactosidase antibody (ICN Pharmaceuticals, Inc.; Costa 
Mesa, CA) in conjunction with a Vectastain ABC kit (Vector Laboratories, Inc., 
Burlingame, CA) to visualize positive cells. 

The chemiluminescent jff-galactosidase activity assay was performed 

15 using the Galacto-Light PlusTM chemiluminescent assay (Tropix, Inc., Foster 
City, CA) system. Tissue samples were collected in lysis matrix tubes 
containing two ceramic spheres (Bio101, Carlsbad, CA) and frozen on dry ice. 
The tissues were thawed and 500 jj\ of lysis buffer from the Galacto-Light Plus 
kit was added to each tube. The tissue was homogenized for 30 seconds using a 

20 FastPrep System (Bio101, Carlsbad, CA). Liver samples were homogenized for 
an additional 30 seconds, jff-galactosidase activity was determined in the liver 
homogenates according to the manufacture's protocol. 

For hexon PCR analysis DNA from tissues was isolated using the Qiagen 
Blood and Cell Culture DNA Midi or Mini Kits (Qiagen Inc., Chatsworth, CA). 

25 Frozen tissues were partially thawed and minced using sterile disposable 

scalpels. Tissues were then lysed by incubation overnight at 55° C in Qiagen 
buffer G2 containing 0.2 mg/ml RNaseA and 0.1 mg/ml protease. Lysates were 
vortexed briefly and then applied to Qiagen-tip 100 or Qiagen-tip 25 columns. 
Columns were washed and DNAs were eluted as described in the manufacturer's 

30 instructions. After precipitation, DNAs were dissolved in water and the 

concentrations were spectrophotometrically determined (A260 and A280) on a 
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DU-600 (Beckman Coulter, Inc.; Fullerton, CA) or a SPECTRAmax PLUS 
(Molecular Devices, Inc.; Sunnyvale, CA) spectrophotometer. 2.3.2. 

PCR primers and a Taqman probe specific to adenovirus hexon sequences 
were designed using Primer Express software v. 1 .0 (Applied Biosystems, Foster 
5 City, CA). Primer and probe sequences were: 

Hexon Forward primer: 5'-CTTCGATGATGCCGCAGTG-3' (SEQ ID No. 38); 
Hexon Reverse primer: 5'-GGGCTCAGGTACTCCGAGG-3' (SEQ ID No. 39); and 
Hexon Probe: 5'-FAM-TTACATGCACATCTCGGGCCAGGAC-TAMRA-3' (SEQ ID No. 
40). 

10 Amplification was performed in a reaction volume of 50 fj\ under the 

following conditions: 10 ng (tumor) or 1 //g (liver and lung) of sample DNA, 1X 
Taqman Universal PCR Master Mix (Applied Biosystems), 600 nM forward 
primer, 900 nM reverse primer and 100 nM hexon probe. Thermal cycling 
conditions were: 2 minute incubation at 50° C, 10 minutes at 95° C, followed by 

15 35 cycles of successive incubation at 95° C for 1 5 seconds and 60° C for 1 
minute. Data was collected and analyzed using the 7700 Sequence Detection 
System software v. 1.6.3 (Applied Biosystems). Quantification of adenovirus 
copy number was performed using a standard curve that includes dilutions of 
adenovirus DNA from 1,500,000 copies to 15 copies in the appropriate 

20 background of cellular genomic DNA. For analysis of tumor tissues, a standard 
curve in a background of 10 ng human DNA was generated. For analysis of 
mouse liver and lung tissues, a standard curve using the same adenovirus DNA 
dilutions in a background of 1 jjg CD-1 mouse genomic DNA was generated. 
Samples were amplified in triplicate, and the average number of total copies was 

25 normalized to copies per cell based on the input DNA weight amount and a 
genome size of 6 x 10 9 bp. 

The results of the jff-galactosidase activity assay and adenoviral hexon 
DNA content for liver transduction by these vectors are shown in Figure 7A and 
7B. The vector containing the KOI or K012 mutations alone showed, on 

30 average, a slight increase in liver transduction compared to AvInBg, which is 
consistent with several previous experiments. The vectors containing the PD1 
mutation alone or combined with KOI or KQ12 showed a slight decrease in liver 
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transduction compared to AvInBg, suggesting that integrins are involved to 
some extent in hepatic uptake of the adenoviral vectors. 

The results of the immunohistochemical staining of liver sections for 
£~galactosidase were consistent with the activity assays (data not shown) and 
5 demonstrate that gene expression was localized specifically to hepatocytes. The 
vectors containing the KOI or K012 mutation alone showed a slight increase in 
liver transduction as revealed by a more intense and frequent 
immunohistochemical-staining pattern. The vectors containing the PD1 
mutation, either alone or combined with KOI or K012, showed little difference 

10 in transduction compared to AvInBg. These results demonstrate that 

ablating the viral interaction with CAR and/or integrins is not sufficient to fully 
detarget adenoviral vectors from the liver in vivo. 

In summary, the fiber AB loop mutation contained in AvInBgFKOI or 
Av1nBgK012 ablates interaction with human and mouse CAR in vitro and 

15 diminished transduction in vitro. In vivo, however, fiber AB loop mutations 
behaved unexpectantly, because such mutations were found to enhance 
adenoviral-mediated gene transfer to liver and results in increasing vector 
potency. The penton base, PD1 mutation that ablates interaction with the 
second receptor involved in adenoviral internalization had no effect in vitro and 

20 little to no effect in vivo. These studies indicated that other receptors are 
responsible for adenoviral gene transfer to the liver in vivo. 

EXAMPLE 4 

Description Of Adenoviral Vectors Containing A Fiber With Amino Acid 
Substitutions At The Heparin Sulfate Binding Domain In The Fiber Shaft 

25 Vectors containing substitutions at all four of the amino acids in the four 

amino acid motif in the Ad5 fiber shaft (residues 91 to 94, KKTK; SEQ ID No. 1) 
were generated in order to ablate the potential interaction with HSP. The 
mutation is termed HSP because it potentially eliminates binding to heparan 
sulfate proteoglycans. Vectors containing the HSP mutation alone and combined 

30 with the KOI mutation (fiber knob AB loop mutation that ablates CAR binding), 
the PD1 mutation (penton mutation that eliminates RGD/integrin interaction), and 
a triple knockout vector (HSP, KOI, PD1) were generated. 
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Generation of the HSP fiber mutation: The HSP mutation was 
incorporated into the fiber gene by using a PCR-based strategy of gene splicing 
by overlap extension (PCR SOEing). First, a segment of the Ad5 genome 
extending from within the E3 region into the 5' end of the fiber gene was 
5 amplified by PCR using the plasmid pSQ1 (Figure 3B) as a template and two 
primers termed 5FF and 5HSPR. The DNA sequence of 5FF is as follows: 5' 
GAA CAG GAG GTG AGC TTA GA 3' (SEQ ID No. 5). This sequence 
corresponds to base pairs 25,199 - 25,218 of pSQ1 . The DNA sequence of 
5HSPR is as follows: 5' GGC TCC GGC TCC GAG AGG TGG GCT CAC AGT 

10 GGT TAG ATT T 3' (SEQ ID No. 1 5). 5HSPR is a reverse primer for 5FF and 
corresponds to a region in the fiber shaft adjacent to the KKTK (SEQ ID No. 1) 
region. The primer contains a 5' extension that encodes a GAGA substitution 
for the native KKTK (encoded by SEQ ID No. 1) amino acid sequence. A second 
PCR using pSQ1 as a template amplified the region immediately 3' of the KKTK 

15 (SEQ ID No. 1) site and extending past the Muni site located 40 base pairs 3' of 
the stop codon for the fiber gene. The two primers used for this reaction were 
3HSPF and 3FR. The DNA sequence of 3HSPF is as follows: 5' GGA GCC GGA 
GCC TCA AAC ATA AAC CTG GAA AT 3' (SEQ ID No. 1 6). It contains a 5' 
extension that is complementary to the 5' extension of 5HSPR. The DNA 

20 sequence of 3FR is as follows: 5' GTG GCA GGT TGA ATA CTA GG 3' (SEQ ID 
No. 8). 

The two PCR products were joined by PCR SOEing using primers 5FF and 
3FR. The resulting PCR product was digested with the restriction enzymes Xbal 
and Muni. The 2355 bp fragment was gel purified and ligated with the 6477 bp 

25 Xbal to Muni fragment of the plasmid pFBshuttle(EcoRI) (Figure 8) to generate 
the plasmid pFBSEHSP. The plasmid pFBshuttle(EcoRI) was generated by 
digesting the plasmid pSQ1 with EcoRI, then gel purifying and self-ligating the 
8.8 kb fragment containing the fiber gene. Next, the fiber gene containing the 
HSP mutation was transferred from pFBSEHSP into pSQ1 using a three-way 

30 ligation. The 16,431 bp EcoRI to Ndel fragment of pSQ1 , the 9043 bp Ndel to 
Xbal fragment of pSQ1 , and the 7571 bp Xbal to EcoRI fragment of pFBSEHSP 
were isolated and ligated to generate pSQIHSP (Figure 9). 
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To generate a recombinant adenoviral vector containing the HSP mutation 
in the fiber gene, pSQ1 HSP was digested with Clal and pAdmireRSVnBg (Figure 
3A) was digested with Sail and Pad, then the two digested plasmids were 
co-transfected into 633 cells (von Seggern eta/. (2000) J Virology 74:354-362). 
5 Homologous recombination between the two plasmids generated a full-length 
adenoviral genome capable of replication in 633 cells, which inducibly express 
Ad5 E1A and constitutively express wild-type fiber protein. After propagation 
on 633 cells, the virus capsid contained wildtype and mutant fiber proteins. To 
obtain viral particles containing only the modified fiber with the HSP mutation, 

10 the viral preparation was used to infect PerC6 cells, which do not express fiber. 
The resulting virus, termed AvInBgFS*, was purified by standard CsCI 
centrifugation procedures. 

Generation of vector containing the HSP and KOI mutations 

To generate an adenoviral vector containing the HSP and K01 mutations 

15 in fiber, a PCR SOEing strategy identical to the one described above was used 
except that the plasmid pSQ1FK01 was used as the template. The PCR SOEing 
product was digested with Xbal and Muni and ligated with the 6477 bp Xbal to 
Muni fragment of pFBshuttle(EcoRI) to generate pFBSEHSPKOI . The fiber gene 
containing the HSP and KOI mutations was transferred from pFBSEHSPKOI into 

20 the pSQ1 backbone using a three-way ligation strategy identical to the one 
described above for the HSP mutation alone, to generate the plasmid 
pSQIHSPKOI (Figure 10). Recombinant adenoviral vector containing the HSP 
and KOI mutations in the fiber gene was generated by co-transfecting 
pSQIHSPKOI digested with Clal and pAdmireRSVnBg digested with Sail and 

25 Pad into 633 cells. Adenovirus was propagated and purified as described above 
for the vector containing the HSP mutation alone. The resulting virus was 
termed Av1 nBgFKOI S*. 
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Generation of vector containing the HSP and PD1 mutations 

The following strategy was used to generate a recombinant adenoviral 
vector containing the fiber HSP mutation and the penton PD1 mutation. The 
plasmid pSQ1PD1 (Figure 4) was digested with the restriction enzymes Csp45l 
5 and Spel and the 23,976 bp fragment was isolated and purified. In addition, the 
plasmid pSQIHSP was also digested with Csp45l and Spel and the 9090 bp 
fragment was isolated and purified and ligated to the 23,976 bp fragment to 
generate the plasmid pSQ1HSPPD1 (Figure 1 1), which contains the fiber HSP 
and penton PD1 mutations. An adenoviral vector was generated, propagated, 

10 and purified as described above. The resulting virus was termed Av1nBgS*PD1 . 
Generation of vector containing the I4SP, KOI, and PD1 mutations 
To generate an adenoviral vector containing the HSP, KOI, and PD1 
mutations the following strategy was used. First, the plasmid pSQ1PD1 was 
digested with Csp45l and Spel and the 23,976 bp fragment was isolated and 

15 purified. In addition, the plasmid pSQIHSPKOI was digested with Csp45l and 
Spel and the 9090 bp fragment was isolated and purified. The two DNA 
fragments were ligated to form the plasmid pSQ1 HSPKOI PD1 (Figure 12). 
Recombinant adenoviral vector was generated, propagated, and purified as 
described above. The resulting virus was termed Av1 nBgFKOI S*PD1 . 

20 EXAMPLE 5 

In Vitro Evaluation Of Adenoviral Vectors Containing The HSP Fiber Mutation 

The transduction efficiencies of adenoviral vectors containing the HSP 
mutation in the fiber gene, either alone or combined with the KOI and/or PD1 
mutations, were evaluated on A549 and HeLa cells. The transduction 

25 efficiencies were compared to that of AvInBg, an adenoviral vector containing 
wild type fiber and penton. The day prior to infection, cells were seeded into 
24-well plates at a density of approximately 1 x 10 5 cells per well. Immediately 
prior to infection, the exact number of cells per well was determined by counting 
a representative well of cells. Each of the vectors, AvInBg (see, Stevenson et 

30 a/. (1 997) J. Virol. 77:4782-4790), Av1 nBgS*, Av1 nBgFKOI S\ 

Av1 nBgS*PD1 , and Av1 nBgFKOI S*PD1 , were used to transduce A549 cells at 
each of the following particle per cell (PPC) ratios: 100, 500, 1000, 2500, 
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5000, 10,000. HeLa cells were transduced with each of the above vectors, as 
well as a vector containing the KOI mutation alone (AvInBgFKOD and a vector 
containing the PD1 mutation alone (Av1nBgPD1) at 2000 PPC. The cell 
monolayers were stained with X-gal 24 hours after infection and the percentage 
5 of cells expressing jg-galactosidase was determined by microscopic observation 
and counting of cells. Transductions were done in triplicate and three random 
fields in each well were counted, for a total of nine fields per vector. 

The results (depicted in Figures 13A-13B) showed significantly reduced 
transduction efficiencies on A549 and HeLa cells using vectors containing the 
10 HSP mutation compared to AvInBg. The vectors containing the HSP mutations, 
however, demonstrated a dose response on A549 cells, in that increasing PPC 
ratios yielded increasing transduction. 

Competition experiments were done to determine which receptor molecular 
interactions are involved in transduction of A549 cells by the various vectors. 

15 Transductions were performed in the presence or absence of various competitors 
including Ad5 fiber knob, a 50 amino acid oligopeptide derived from Adenovirus 
serotype 2 penton base which spans the RGD tripeptide region, or heparin 
(Invitrogen Life Technologies, Gaithersburg, MD). Monolayers of A549 cells 
were cultured in Richters medium supplemented with 10% FBS and were 

20 transduced with AvInBg, AvInBgS*, Av1 nBgFKOI S\ Av1nBgS*PD1, or 

Av1 nBgFKOI S*PD1 in infection medium (IM, Richters medium plus 2% FBS). 
Different PPC ratios were used for the different vectors to achieve measurable 
transduction levels. The PPC ratios were as follows: AvInBg: 500 PPC, 
AvInBgS*: 10,000 PPC, Av1 nBgFKOI S*: 20,000 PPC, Av1nBgS*PD1: 10,000 

25 PPC, and Av1 nBgFKOI S*PD1 : 20,000 PPC. Fiber knob competition was 

performed by pre-incubating cells in IM containing 16 /vg/ml of fiber knob for 10 
minutes at room temperature prior to infection with virus. Penton base peptide 
competition was performed by pre-incubating cells in IM containing 500nM 
peptide for 10 minutes at room temperature prior to infection with virus. 

30 Heparin competition was performed by pre-incubating each adenoviral vector in 
IM containing 3 mg/ml of heparin for 20 minutes at room temperature. In all 
cases, the competitor remained in the IM during the 1 hour infection when virus 
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was rocked on the cell monolayers at 37° C in 5% C02. After infection, the 
monolayers were washed with PBS, 1 ml of complete medium was added per 
well and the cells were incubated for an additional 24 hours to allow for 
/?-galactosidase expression. The cell monolayers were then fixed and stained 
5 with X-Gal. The percentage of cells transduced was determined by light 

microscopy as described above. Each condition was carried out in triplicate and 
three random fields per well were counted, for a total of nine fields per 
condition. The average percentage of transduction per high-power field was 
determined. 

10 The results of the competition experiment (Figure 13C) showed that fiber 

knob inhibited transduction of cells by all vectors except for those that contained 
the KOI mutation. The penton base peptide only inhibited transduction by 
Av1nBgFK01S*. Heparin inhibited transduction by Av1 nBgFKCM S* and 
Av1nBgFK01S*PD1, but did not affect transduction by any of the other viruses 

15 suggesting the presence of additional heparin binding sites on the adenoviral 
capsid but that the shaft contains the predominant site. 

EXAMPLE 6 

In Vivo Analysis Of Adenoviral Vectors Containing The HSP Mutation In Fiber 

The objective of this study was to evaluate the in vivo biodistribution of 
20 adenoviral vectors containing the HSP mutation and to determine whether this 
shaft modification influences adenoviral-mediated liver transduction. In addition, 
vectors containing the HSP mutation combined with KOI , or PD1 , or a 
combination of all three mutations were evaluated as well as vectors containing 
the K01 mutation alone and the PD1 mutation alone. A positive control cohort 
25 received AvInBg and a negative control group received HBSS. Cohorts of five 
C57BL/6 mice received each vector via tail vein injection at a dose of 1 x 10 13 
particles per kg. The animals were sacrificed approximately 72 hours after vector 
administration by carbon dioxide asphyxiation. Liver, heart, lung, spleen, and 
kidney were collected from each animal. The median lobe of the liver was placed 
30 in neutral buffered formalin to preserve the sample for ^-galactosidase 

immunohistochemistry. In addition, tissue from each organ was frozen to 
preserve it for hexon real time PGR analysis to determine vector content. A 
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separate sample of liver from each mouse was frozen to preserve it for a 
chemiluminescent yff-galactosidase activity assay. /?-galactosidase 
immunohistochemistry, hexon real-time PGR and the chemiluminescent 
/?-ga!actosidase activity assay were carried out as described in Example 3. 
5 The results of the jS-galactosidase activity assay (Figure 14A) and adenoviral 
hexon DNA content (Figure 14B) showed a dramatic reduction in liver 
transduction by vectors containing the HSP mutation. The vectors containing 
the HSP mutation alone resulted in reducing adenoviral-mediated liver gene 
expression by approximately 20-fold. When combined with the K01 mutation 

10 (HSP, KOI, PD1 ), yielded approximately a 1000-fold reduction in jff-galactosidase 
activity in the liver compared to the control vector AvInBg. The vector 
containing the KOI mutation alone showed a slight increase, on average, in liver 
transduction compared to AvInBg, which is consistent with several previous 
experiments. The vectors containing the PD1 mutation alone or combined with 

15 KOI showed a slight decrease in liver transduction compared to AvInBg, 
although the decrease was not statistically significant. Analysis of hepatic 
adenoviral hexon DNA content (Figure 14B) confirmed these results. 

The results of the immunohistochemical staining of liver sections for /?- 
galactosidase were consistent with the activity assays (data not shown) and 

20 demonstrated that gene expression was localized specifically to hepatocytes. 
Vectors containing the HSP mutation, either alone or in combination with KOI 
and/or PD1, showed a dramatic reduction in hepatocyte transduction. The 
vector containing the KOI mutation alone showed a slight increase in liver 
transduction as revealed by a more intense and frequent immunohistochemical 

25 staining pattern. The vectors containing the PD1 mutation, either alone or 
combined with K01, showed little difference in transduction compared to 
Av1 nBg. 

EXAMPLE 7 

Description of Adenoviral Vectors Containing the HSP Fiber Shaft Mutation with 
30 and without the KOI Fiber Mutation and with and without a cRGD Targeting 
Ligand in the Fiber Knob HI Loop 



WO 2003/062400 



PCT/US2003/002295 



-70- 

Generation of vector containing the HSP fiber shaft mutation and a cRGD ligand 
in the Hi loop: The following strategy was used to generate an adenoviral vector 
containing a fiber with the HSP shaft mutation and a cRGD ligand in the HI loop. 
The plasmid p5FloxHRFRGD was digested with the restriction enzymes BstXI 
5 and Kpnl and the 1 157 bp fragment was isolated and purified. In addition, the 
fiber shuttle plasmid pFBSEHSP, described in Example 1 above, was digested 
with BstXI and Kpnl and the 4549 bp and 3156 bp fragments were isolated and 
purified. The three fragments were ligated to generate the plasmid 
pFBSEHSPRGD, which encodes a fiber containing the HSP mutation and cRGD in 

10 the HI loop. The fiber gene from this plasmid was transferred into the pSQ1 

backbone as follows. The plasmid pFBSEHSPRGD was digested with EcoRI and 
Xbal and the 7601 bp fragment was isolated and purified. The plasmid pSQ1 
(Figure 3B) was digested with the restriction enzymes EcoRI, Ndel, and Xbal and 
the 16,431 bp EcoRI to Ndel fragment and the 9043 bp Ndel to Xbal fragment 

15 were isolated and purified. The three DNA fragments were ligated to generate 
the plasmid pSQIHSPRGD (Figure 15A). 

To generate a recombinant adenoviral vector containing the HSP mutation 
in the fiber gene along with a cRGD ligand in the HI loop, the plasmid 
pSQ1 HSPRGD was digested with Clal and co-transfected into 633 cells with 

20 pAdmireRSVnBg which had been digested with Sail and Pad. After propagation 
on 633 cells, the virus capsid contained wildtype and mutant fiber proteins. To 
obtain viral particles containing only the modified fiber with the HSP mutation 
and a cRGD ligand, the viral preparation was used to infect PerC6 cells, which 
do not express fiber. The resulting virus, termed Av1 nBgS*RGD, was purified 

25 by standard CsCI centrifugation procedures. 

Generation of vector containing the HSP fiber shaft mutation, the KOI fiber knob 
mutation, and a cRGD ligand in the HI loop 

The following strategy was used to generate an adenoviral vector 

containing a fiber with the HSP shaft mutation, the KOI fiber knob mutation, 

30 and a cRGD ligand in the HI loop. The plasmid p5FloxHRFRGD was digested 

with the restriction enzymes BstXI and Kpnl and the 1 1 57 bp fragment was 

isolated and purified. In addition, the fiber shuttle plasmid pFBSEHSPKOI, 
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described in Example 1 above, was digested with BstXI and Kpnl and the 4549 
bp and 3156 bp fragments were isolated and purified. The three fragments were 
ligated to generate the plasmid pFBSEHSPKOI RGD, which encodes a fiber 
containing the HSP mutation, the KOI mutation, and cRGD in the HI loop. The 
5 fiber gene from this plasmid was transferred into the pSQ1 backbone as follows. 
The plasmid pFBSEHSPKPOl RGD was digested with EcoRI and Xbal and the 
7601 bp fragment was isolated and purified. The plasmid pSQ1 (Figure 3B) was 
digested with the restriction enzymes EcoRI, Ndel, and Xbal and the 16,431 bp 
EcoRI to Ndel fragment and the 9043 bp Ndel to Xbal fragment were isolated 

10 and purified. The three DNA fragments were ligated to generate the plasmid 
pSQIHSPKOIRGD (Figure 15B). 

To generate a recombinant adenoviral vector containing the HSP and KOI 
mutations in the fiber gene along with a cRGD ligand in the HI loop, the plasmid 
pSQIHSPKOIRGD was digested with Clal and co-transfected into 633 cells with 

15 pAdmireRSVnBg which had been digested with Sail and Pad. After propagation 
on 633 cells, the virus capsid contained wildtype and mutant fiber proteins. To 
obtain viral particles containing only the modified fiber with the HSP and KOI 
mutations and a cRGD ligand, the viral preparation was used to infect PerC6 
cells, which do not express fiber. The resulting virus, termed 

20 Av1nBgFK01S*RGD, was purified by standard CsCI centrifugation procedures. 

EXAMPLE 8 

In Vitro Evaluation of Adenoviral Vectors Containing the HSP Fiber Shaft 
Mutation with or without the Fiber Knob KOI Mutation and with or without a 
cRGD Ligand in the HI Loop 

25 The transduction efficiencies of adenoviral vectors containing the HSP 

fiber shaft mutation with or without the fiber KOI mutation and with or without 
the cRGD ligand in the HI loop were evaluated on A549 cells. The transduction 
efficiencies were compared to that of AvInBg, an adenoviral vector containing 
wild type fiber. The day prior to infection, cells were seeded into 24-well plates 

30 at a density of approximately 1 x 10 5 cells per well. Immediately prior to 
infection, the exact number of cells per well was determined by counting a 
representative well of cells. Each of the vectors, AvInBg, AvInBgS*, 
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Av1 nBgFKOI S*, Av1 nBgS*RGD, and Av1 nBgFKOI S*RGD, were used to 
transduce A549 cells at a particle to cell ratio of 6250. The cell monolayers 
were stained with X-gal 24 hours after infection and the percentage of cells 
expressing yS-galactosidase was determined by microscopic observation and 
5 counting of cells. Transductions were done in triplicate and three random fields 
in each well were counted, for a total of nine fields per vector. The results 
(Figure 16) showed that the cRGD ligand dramatically increased the transduction 
efficiencies of vectors containing the HSP mutation alone or combined with the 
K01 mutation. AvInBgS* yielded approximately 22% positive cells, while 
10 Av1nBgS*RGD yielded approximately 95% positive cells. Similarly, 

Av1 nBgFKOI S* yielded only 4% positive cells, while Av1 nBgFKOI S*RGD 
yielded 85% positive cells. Therefore, the vector containing the shaft mutation 
is viable and can be retargeted with the addition of a ligand. 

EXAMPLE 9 

15 Construction Of Ad5 Vectors Containing The Ad35 Fiber And Derivatives 
Thereof 

The KOI and HSP mutations in the Ad5 fiber protein (5F), described 
above, were designed to ablate interactions that are responsible for the normal 
tropism of the Ad5 virus. An alternative strategy to detarget the virus is to 

20 replace the Ad5 fiber with a fiber from another serotype which does not bind 

CAR and which does not possess the heparin sulfate proteoglycan (HSP) binding 
domain (KKTK; SEQ ID No. 1) within the shaft. The fiber of adenovirus serotype 
35 (35F) does not bind CAR and does not possess the HSP binding domain in its 
shaft. Replacement of the 5F with the 35F can detarget the liver and provide a 

25 suitable platform for retargeting the vector to the desired tissue. 

Generation of an Ad5 based vector containing the Ad35 fiber: A PCR 
SOEing strategy was used to generate a vector based on the Ad5 serotype but 
containing the Ad35 fiber in place of the Ad5 fiber. First, PCR was used to 
amplify a region in the plasmid pSQ1 between the Xbal site at bp 25,309 and 

30 the start of the fiber gene. The primers used for this reaction were P-0005/U 
and P-0006/L. The DNA sequence of P-0005/U was as follows: 5' C TCT AGA 
AAT GGA CGG AAT TAT TAC AG 3' (SEQ ID No. 17). This sequence 
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corresponds to bp 25,308 through 25,334 of pSQ1 . The DNA sequence of 
P-0006/L was as follows: 5' TCT TGG TCA TCT GCA ACA ACA TGA AGA TAG 
TG 3' (SEQ ID No. 18). it contains a 10 base pair 5' extension that is 
complementary to the start of the Ad35 fiber gene, while the remainder of the 
5 primer anneals to the sequence immediately 5' of the ATG start codon of the 
fiber gene in pSQ1 . A PGR product of the expected size, 583 bp, was obtained 
and the DNA was gel purified. A second PCR amplified the Ad35 fiber gene 
using DNA extracted from wildtype Ad35 virus as a template. The primers used 
for this reaction were P-0007/U and 35FMun. The DNA sequence of P-0007/U 
10 ( was as follows: 5' GT TGT TGC AG ATG ACC AAG AGA GTC CGG CTC A 3' 
(SEQ ID No. 19). It contains a 10 base pair 5' extension that is homologous to 
the 10 bp immediately prior to the ATG start codon of the fiber gene in Ad5. 
The remainder of the primer anneals to the start of the Ad35 fiber gene. The 
DNA sequence of 35FMun was as follows: 5' AG CAA TTG AAA AAT AAA 
1 5 CAC GTT GAA ACA TAA CAC AAA CG A TTC TTT A GTT GTC GTC TTC TGT 
AAT GTA AGA A 3' (SEQ ID No. 20). It contains a 46 base pair 5' extension 
that is complementary to the region of the Ad5 genome between the end of fiber 
and the Muni site 40 bp downstream of the fiber gene. In addition, the 5' 
extension encodes the last amino acid and stop codon of the Ad5 fiber gene. 
20 This region was retained in the vector because it contains the polyadenylation 
site for the fiber gene. The remainder of the primer anneals to the 3' end of the 
Ad35 fiber gene, up to the next to last amino acid codon. A PCR product of the 
expected size, 1027 bp, was obtained and the DNA was gel purified. The two 
PCR products were mixed and joined together by PCR SOEing using primers 
25 P-0005/U and P-0009. The DNA sequence of P-0009 was as follows: 5' AG 

CAA TTG AAA AAT AAA CAC GTT G 3' (SEQ ID No. 21). It corresponds to bp 
27,648 through 27,669 of pSQ1 and overlaps the Muni site in that region. A 
PCR product of the expected size, 1590 bp, was obtained and gel purified. It 
was cloned into the plasmid pCR4blunt-TOPO (Invitrogen Corporation, Carlsbad 
30 CA) using the Zero Blunt TOPO PCR Cloning Kit from Invitrogen. This 

intermediate cloning step simplified DNA sequencing of the PCR SOEing product. 
The resulting plasmid, termed pTOPOAd35F, was digested with Xbal and Muni 
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and the 1 585 bp digestion product was gel purified and ligated with the 6477 bp 
fragment of pFBshuttle(EcoRI) digested with Xbal and Muni to generate the 
plasmid pFBshuttleAd35F. The Ad35 fiber gene was transferred from 
pFBshuttleAd35F into pSQ1 as follows. The plasmid pSQ1 was digested with 
5 EcoRI and the 24,213 bp fragment was gel purified. The plasmid 

pFBshuttleAd35F was linearized with EcoRI and ligated with the 24,213 bp 
fragment from pSQ1 . Restriction diagnostics were performed to screen for 
constructs containing the Ad35 fiber gene inserted into the pSQ1 backbone in 
the correct orientation. The pSQ1 plasmid containing the Ad35 fiber gene in the 

10 proper orientation was termed pSQ1 Ad35Fiber (Figure 17A). To generate 

adenoviral vector containing the Ad35 fiber, pSQ1 Ad35Fiber was digested with 
Clal and co-transfected into 633 cells with pAdmireRSVnBg which had been 
digested with Sail and Pad. After propagation on 633 cells, the resulting virus 
contained Ad5 fiber and Ad35 fibers on its capsid. The virus was amplified on 

15 PerC6 cells to generate virus containing only the Ad35 fiber on its capsid. The 
resulting virus preparation was termed Av1nBg35F. 

Construction of adenoviral vectors containing chimeric fibers derived from 
Ad5 and Ad35: Two chimeric fiber constructs were prepared by PCR gene 
overlap extension using plasmids containing the full length Ad5 or Ad35 fiber 

20 cDNAs as templates. The Ad5 fiber tail and shaft regions (5TS; amino acids 1 
to 403) were connected with the Ad35 fiber head region (35H; amino acids 137 
to 323) to form the 5TS35H chimera, and the Ad35 fiber tail and shaft regions 
(35TS; amino acids 1 to 136) were connected with the Ad5 fiber head region 
(5H; amino acids 404 to 581 ) to form the 35TS5H chimera. The fusions were 

25 made at the conserved TLWT sequence at the fiber shaft-head junction. 

For the construction of the 5TS35H chimera, the pFBshuttle(EcoR1 ) 
plasmid was used as the template with primers P1 and P2 to generate the 5' 
fragment. The 3' fragment was generated using the pFBshuttleAd35 plasmid as 
the template with the P3 and P4 primers. The sequence of each primer used in 

30 the construction of these chimeric fibers is listed in Table 2. Amplified PCR 
products of the expected size were obtained and were gel purified. A second 
PCR was carried out with the end primers P1 and P4 to join the two fragments 
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together. The DNA fragment generated in the second PCR was digested with 
Xba1 and Muni and was cloned directly into pFBshuttle(EcoR1 ) to create the 
fiber shuttle plasmid pFBshuttle5TS35H. 

TABLE 2 

5 Primers Used For The Exchange Of Fiber Shaft Regions Between Ad5 And Ad35 

Fibers 



Primer 
rip^ia nation 


Sequence 


SEQ ID 


P1 


5'-GAACAGGAGGTGAGCTTAGA-3' 


22 


P2 | 


5'-GTTAGGTGGAGGGTTTATTCCGGTCCAC 
AAAGTTAGCTTATC-3' 


23 


P3 


5'-GATAAGCTAACTTTGTGGACCGGAATAAA 
CCCTCCACCTAAC-3' 


24 


P4 


5'-GTGGCAGGTTGAATACTAGG-3 


25 


P5 


5'-GTTAGGAGATGGAGCTGGTGTAGTCCATA 
AGGTGTTAATAC-3' 


26 


P6 


5'-GTATTAACACCTTATGGACTACACCAGCT 
CCATCTCCTAAC-3' 


27 


P7 


5'-TGCGCAAAAACAATCACCACGACAATCACAAT 
GTACATTGGAAGAAATCATACG-3' 


28 


P8 


5'-ACATTGTGATTGTCGTGGTGATT 
GTTTTTGCGCATATGCCATACAA I I I GAATG-3' 


29 



For the construction of the 35TS5H chimera, the pFBshuttleAd35 plasmid 
was used as the template with the P1 and P5 primers to generate the 5' 
fragment. The 3' fragment was generated using the pFBshuttle(EcoR1 ) plasmid 
as the template with the P6 and P4 primers. Following the same procedure 
described above, the fiber shuttle plasmid pFBshuttle35TS5H was generated. 

For the 35TS5H and 5TS35H chimeras, the fiber gene was transferred 
from the pFBshuttle(EcoRI) backbone into pSQ1 as described above for the 
vector containing the Ad35 fiber. The resulting plasmids were called 
pSQ135T5H (Figure 18A) and pSQ15T35H (Figure 18B). In addition, adenoviral 
vectors were generated using the co-transfection strategy described above. 

Construction of Ad5 vectors containing the Ad35 fiber with a cRGD 
targeting peptide in the HI loop of the 35F fiber knob: To incorporate the cRGD 
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targeting peptide into the Ad35 fiber HI loop, the P7 and P8 oligonucleotide 
primers encoding the ten amino acid sequence HCDCRGDCFC (SEQ ID No. 30) 
were synthesized. The pFBshuttleAd35 plasmid containing the full length Ad35 
fiber cDNA was used as the template in the PGR reaction with the P1 and P7 
5 primer pair or with the P4 and P8 primer pair in order to generate the 5' and 3' 
PGR fragments. A second PGR was then carried out with the end primers P1 
and P4 to join the two fragments together. The resulting PCR fragment was 
digested with Xba1 and Muni and was cloned into pFBshuttle(EcoR1 ) to create 
the fiber shuttle plasmid pFBshuttleAd35cRGD. The modified Ad35 fiber gene 
10 was transferred into pSQ1 using the EcoRl cloning strategy described above to 
generate pSQ1 Ad35FcRGD (Figure 17B). Adenoviral vector was generated 
using the co-transfection strategy described above. 

EXAMPLE 10 

In Vitro Evaluation Of Adenoviral Vectors Containing 35F And Derivatives 
1 5 Thereof 

The transduction efficiencies of adenoviral vectors containing the 35F or 
derivatives thereof were evaluated on A549 cells. The transduction efficiencies 
were compared to that of AvInBg, an adenoviral vector containing the 5F fiber. 
The day prior to infection, cells were seeded into 24-well plates at a density of 

20 approximately 1 x 10 5 cells per well. Immediately prior to infection, the exact 
number of cells per well was determined by counting a representative well of 
cells. Each of the vectors, AvInBg, Av1nBg35F, Av1nBg5T35H and 
Av1nBg35T5H were used to transduce A549 cells from 0 up to 1,000 particle 
per cell (PPG) ratios. The cell monolayers were stained with X-gal 24 hours after 

25 infection and the percentage of cells expressing /?-galactosidase was determined 
by microscopic observation and counting of cells. Transductions were done in 
triplicate and three random fields in each well were counted, for a total of nine 
fields per vector. The results (Figure 19) showed similar transduction 
efficiencies on A549 cells using the Av1nBg35F and Av1nBg5T35H vectors 

30 compared to AvInBg. The Av1nBg35T5H showed much lower transduction 
efficiencies on A549 cells compared to Av1 nBg as a result of the Ad35 shaft 
domain. The Ad35 shaft domain does not contain a HSP binding motif and the 
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AvlnBg35T5H vector behaves similarly to the AvInBgS* vector in vitro and in 
vivo. These studies also demonstrate that vectors containing fiber proteins 
without an HSP binding site are fully viable. 

EXAMPLE 1 1 

5 in Vivo Evaluation Of Adenoviral Vectors Containing 35F And Derivatives 
Thereof 

The objective of this study was to evaluate the in vivo biodistribution of 
adenoviral vectors containing 35F fibers and derivatives thereof to determine 
whether vectors containing these fibers ablate liver transduction due to their 

10 shaft regions. A positive control cohort received AvInBg and a negative control 
group received HBSS. Cohorts of five C57BL/6 mice received each vector via 
tail vein injection at a dose of 1 x 10 13 particles per kg. The animals were 
sacrificed approximately 72 hours after vector administration by carbon dioxide 
asphyxiation. Liver, heart, lung, spleen, and kidney were collected from each 

15 animal. The median lobe of the liver was placed in neutral buffered formalin to 
preserve the sample for /?-galactosidase immunohistochemistry. In addition, 
tissue from each organ was frozen to preserve it for hexon PCR analysis to 
determine vector content. A separate sample of liver from each mouse was 
frozen to preserve it for a chemiluminescent /?-galactosidase activity assay, /?- 

20 galactosidase immunohistochemistry, hexon real-time PCR and the 

chemiluminescent ^-galactosidase activity assay were carried out as described in 
example 3. 

The results of the /^-galactosidase activity assay showed a dramatic 
reduction in liver transduction by vectors containing the Ad35 fiber or the 
25 35T5H derivative (Figure 20) with an approximately 4- to 24-fold reduction in /?- 
galactosidase activity in the liver compared to the control vector AvInBg. These 
data demonstrate that shaft domains without HSP binding sites can effectively 
ablate hepatic in vivo gene transfer. In particular, HSP is the major entry 
mechanism for liver in vivo. CAR binding is a minor entry pathway. 
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EXAMPLE 12 

Construction Of Ad5 Vectors Containing The Ad Serotype 41 Short Fiber And 
Derivatives Thereof 

The human adenovirus serotype 41 contains two different fibers on its 

5 capsid, encoded by two adjacent genes. One fiber has a molecular weight of 

60kDa and is approximately 315A in length and is termed the long fiber. The 

other fiber has a molecular weight of 40kDa and is approximately 250-)- in 

length and is termed the short fiber. The Ad41 short fiber does not bind CAR 

and does not possess the heparin binding domain (KXTK) in its shaft. Therefore, 

10 this fiber provides a useful platform for adenoviral vector targeting. 

Construction of adenoviral vectors based on Ad5 but containing the Ad41 
short fiber: A PCR SOEing strategy was used to generate a vector based on the 
Ad5 genome but containing the Ad41 short (Ad41s) fiber. First, PCR was used 
to amplify the region of pSQl between the Xbal site at bp 25,309 and the start 

15 of the fiber gene. The primer pair used for the PCR were P-0005/U and 

P-0010/L. The DNA sequence of P-0005/U was as follows: 5' C TCT AGA AAT 
GGA CGG AAT TAT TAC AG 3' (SEQ ID No. 17). The sequence corresponds to 
bp 25,308 through 25,334 of pSQ1 and overlaps the Xbal site in that region. 
The DNA sequence of P-0010/L was as follows: 5' TTC TTT TCA T CTG CAA 

20 CAA CAT GAA GAT AGT G 3' (SEQ ID No. 31 ). It contains a 5' extension 

corresponding to the first 10 bp of the Ad41s fiber gene. The remainder of the 
primer anneals to pSQ1 immediately 5' of the ATG start codon of the fiber gene. 
The PCR product was the expected size (583 bp). A second PCR was used to 
amplify the Ad41s fiber using the plasmid pDV60Ad41sF as a template. The 

25 primers used were P-001 1/U and P-0012/L. The DNA sequence of P-001 1/U 
was as follows: 5' GT TGT TGC AG ATG AAA AGA ACC AGA ATT GAA G 3' 
(SEQ ID No. 32). It contains a 10 bp 5' extension corresponding to the DNA 
sequence immediately 5' of the ATG start codon of the fiber gene in pSQ1 . The 
remainder of the primer anneals to the beginning of the Ad41s fiber gene in 

30 pDV60Ad41sF. The DNA sequence of P-0012/L was as follows: 5' TG CAA 

TTG AAA AAT AAA CAC GTT GAA ACA TAA CAC AAA CGA TTC TTT ATT C 
TTC AGT TAT GTA GCA AAA TAC A 3' (SEQ ID No. 33). It contains a 51 bp 
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5' extension corresponding to the sequence in pSQ1 from the last codon of the 
fiber gene through the Mun! site 40 bp downstream of the fiber gene. The 
remainder of the primer anneals to the 3' end of the Ad41s fiber gene in 
pDV60Ad41sF. The PCR product was the expected size (1219 bp). The two 
5 PCR products were joined by PCR SOEing using primers P-0005/U and P-0009/L. 
The DNA sequence of P-0009/L was described above. The PCR SOEing reaction 
yielded the expected 1782 bp product. The product was cloned into 
pCR4blunt-TOPO to yield pCR4blunt-TOPOAd41 sF. Next, 
pCR4blunt-TOPOAd41sF was digested with Xbal and Muni and the 1773 bp 

10 fragment containing the Ad41s fiber gene was gel purified. This fragment was 
ligated with the 6477 bp Xbal to Muni fragment of pFBshuttle(EcoRI) to generate 
pFBshuttleAd41sF. The Ad41s fiber gene was transferred into the pSQ1 
backbone as follows. First, pFBshuttleAd41 sF was linearized using EcoRI and 
this fragment was ligated with the 24,213 bp EcoRI fragment of pSQ1 to 

15 generate pSQ1Ad41sF (Figure 21 A). Adenoviral vector containing the Ad41s 
fiber was generated using the co-transfection strategy described above. 

Construction of Ad5 adenoviral vectors containing the Ad41 short fiber 
with a cRGD targeting ligand in the HI loop: A PCR SOEing strategy was used to 
generate a construct containing the Ad41s fiber with cRGD in the HI loop. The 

20 plasmid pFBshuttleAd41 sF was used as a template for the PCR amplifications. 
First, a 1782 bp fragment was amplified using primers 5FF and 41 sRGDR. The 
primer 5FF was described above. It anneals to pFBshuttleAd41 sF at the Xbal 
site upstream of the fiber gene. The DNA sequence of the primer 41 sRGDR was 
as follows: 5' AGT ACA AAA ACA ATC ACC ACG ACA ATC ACA GTT TAT 

25 CTC GTT GTA GAC GAC ACT GA 3' SEQ ID No. 34). It contains a 30 bp 5' 

extension that encodes the cRGD targeting ligand. The remainder of the primer 
anneals to pFBshuttleAd41 sF from bp 2878 through 2903. A second PCR 
amplified a 277bp region of pFBshuttleAd41 sF using primers 3FR and 41sRGDF. 
The primer 3FR was described previously. It anneals to pFBshuttleAd41 sF at the 

30 Muni site downstream of the fiber gene. The DNA sequence of 41 sRGDF was 
as follows: 5' TGT GAT TGT CGT GGT GAT TGT TTT TGT ACT AGT GGG TAT 
GCT TTT ACT TTT 3' (SEQ ID No. 35). It contains a 30 bp 5' extension that 
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encodes the cRGD targeting ligand and is complementary to the extension on 
41 sRGDR. The remainder of the primer anneals to pFBshuttleAd41 sF from bp 
2904 through 2924. The two PCR products were joined by PGR SOEing to 
generate a 2059 bp fragment using primers 5FF and 3FR. The product was 
5 digested with Xbal and Muni and the 1803 bp DNA fragment was gel purified. 
The fragment was ligated with the 6477 bp fragment resulting from digestion of 
pFBshuttle(EcoRI) with Xbal and Muni. The resulting plasmid was termed 
pFBshuttleAd41 sRGD. This plasmid was linearized by EcoRI digestion and 
ligated with the 24,21 3bp EcoRI fragment of pSQ1 to generate pSQ1Ad41sRGD 
10 (Figure 21 B). 

EXAMPLE 13 

In Vivo Evaluation Of Ad5 Vectors Containing The Ad41 Short Fiber And 
Derivatives Thereof 

This example evaluates the in vivo biodistribution of adenoviral vectors 

15 containing 41 sF fibers and derivatives thereof to determine whether vectors 
containing the these fibers ablate liver transduction due to modified shaft 
regions. A positive control cohort received Av3nBg (see, Gorziglia eta/. (1996) 
J, Virology 70:4173-4178) or Ad5.jffGal.AF/5F, and a negative control group 
received HBSS. Ad5.yffGal.AF/5F is a derivative of the fiberless vector 

20 Ad5.jffgal.AF (ATCC accession number VR2636) modified to express AD5 fiber 
(see, e.g., International PCT application No. WO01 83729). 

The Ad5.yffGal.AF vector was pseudotyped with the Ad41sF fiber protein 
and injected in vivo. Cohorts of five C57BL/6 mice received each vector via tail 
vein injection at a dose of 1 x 10 13 particles per kg. The animals were sacrificed 

25 approximately 72 hours after vector administration by carbon dioxide 

asphyxiation. Liver, heart, lung, spleen, and kidney were collected from each 
animal. The median lobe of the liver was placed in neutral buffered formalin to 
preserve the sample for yff-galactosidase immunohistochemistry. In addition, 
tissue from each organ was frozen to preserve it for hexon PCR analysis to 

30 determine vector content. A separate sample of liver from each mouse was 
frozen to preserve it for a chemiluminescent yff-galactosidase activity assay. JS- 
galactosidase immunohistochemistry, hexon real-time PCR and the 
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chemiluminescent /?-galactosidase activity assay was carried out as described in 
example 3. 

The results of the hexon DNA analysis showed a dramatic reduction in 
liver transduction by vectors containing the Ad41sF fiber (Figure 22) with an 
5 approximately a 5-fold reduction in liver adenoviral DNA content compared to 
either control vector. 

In the above examples, several novel adenoviral vectors were generated 
containing various fiber modifications designed to ablate the normal tropism of 
the vector. See Table 3. Vectors were generated in which the heparan sulfate 

10 binding domain in the fiber shaft was replaced by amino acid substitutions. This 
mutation, termed HSP, was also combined with the KOI mutation (fiber knob AB 
loop mutation that ablates CAR binding), and the PD1 mutation (penton mutation 
that eliminates RGD/integrin interaction). In addition, a vector containing all three 
mutations (HSP, KOI, PD1) was generated. All vectors containing the HSP 

15 mutation, either alone or combined with other capsid modifications, showed 
dramatically reduced transduction efficiencies on A549 and HeLa cells. 
Furthermore, the same vectors showed dramatically reduced transduction of the 
liver following systemic delivery to mice. As an alternative strategy to ablate the 
normal tropism of Ad5-based vectors, the Ad5 fiber was replaced by a fiber from 

20 a different adenovirus serotype which does not bind CAR and does not contain 
the heparan binding domain in the shaft. Thus, vectors were generated 
containing the Ad35 fiber and the Ad41 short fiber. Versions of these two 
vectors containing a cRGD targeting ligand in the HI loop of the fiber were also 
produced. Additionally, vectors containing chimeric fibers were generated. A 

25 vector containing the Ad35 fiber tail and shaft regions fused to the Ad5 fiber 

knob domain as well as a vector containing the Ad5 fiber tail and shaft fused to 
the Ad35 fiber knob domain were constructed. Vectors containing either the 
entire Ad35 or Ad41 short fiber showed a significant reduction in liver 
transduction following delivery to mice via the tail vein. The observation of 

30 reduced liver transduction using vectors containing either an HSP mutation, the 
Ad35 fiber, or the Ad41 short fiber indicates the feasibility of detargeting 
adenoviral vectors in vivo. In vitro data with the Ad35 fiber or the Ad41 short 



WO 2003/062400 



PCT/US2003/002295 



-82- 

fiber with cRGD (see Example 14) indicate that the virus is completely viable, 
that is, it is not damaged by the absence of an HSP binding site and is 
retargetable. Taken together these data suggest that these vectors provide a 
suitable platform for retargeting strategies. 
5 TABLE 3 



Description Of Recombinant Adenoviral Vectors Used 
To Demonstrate That Shaft Modifications Influence Tropism In Vivo 

Vector 



Vector 


Description 


Av1 nBg 


An E1 and E3-deleted adenoviral vector encoding a nuclear 
localizing /?-ga!actosidase 


Ad5 Fiber derivatives: 


AvInBgFKOI 


The same as AvInBg but containing the K01 AB loop 
mutation in the fiber gene 


Av1nBgPD1 


The same as AvInBg but containing the penton PD1 mutation 
that deletes the integrin binding, RGD tripeptide 


AvInBgS* 


The same as AvInBg but containing the 4 amino acid 
substitution in the shaft referred to as S* that modifies the 
HSP binding motif 


AvInBgFKOIS* 


The same as AvInBg but containing the fiber KOI and S* 
mutations combined 


Av1nBgS*PD1 


The same as AvInBg but containing the fiber S* and penton 
PD1 mutations combined 


Av1nBgFK01S*PD1 


The same as AvInBg but containing the fiber KOI, S* and 
penton PD1 mutations combined 


Ad35 fiber derivatives: 


Av1nBg35F 


The same as AvInBg but containing the full length Ad35 fiber 
cDNA 


Av1nBg5T35H 


The same as AvInBg but containing the 5T35H chimeric fiber 


Av1nBg35T5H 


The same as AvInBg but containing the 35T5H chimeric fiber 


Av1nBg35FRGD 


The same as AvInBg but containing the full length Ad35 fiber 
cDNA with a cRGD ligand in the HI loop of the Ad35 fiber 


Ad41sF fiber derivatives: 


Av1nBg41sF 


The same as AvInBg but containing the full length Ad41 short 
fiber cDNA 
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Vector 


Description 


Av1nBg41sFRGD 


The same as AvInBg but containing the full length Ad41 short 
fiber cDNA with a cRGD ligand in the HI loop of the Ad41 
short fiber 



EXAMPLE 14 

in Vitro Evaluation Of Adenoviral Vectors Containing The Ad41sF With A cRGD 
5 Ligand In The HI Loop 

The transduction efficiencies of adenoviral vectors containing the Ad41sF 

fiber with the cRGD ligand in the HI loop were evaluated on A549 cells. The 

transduction efficiencies were compared to that of AvInBg, an adenoviral vector 

containing wild type fiber or Av1 nBgFKOI RGD, an adenoviral vector containing 

10 the KOI mutation in combination with the cRGD ligand in the HI loop. The day 
prior to infection, cells were seeded into 24-well plates at a density of 
approximately 1 x 10 5 cells per well. Immediately prior to infection, the exact 
number of cells per well was determined by counting a representative well of 
cells. Each of the vectors, AvInBg, Av1 nBgFKOI RGD, and Av1 nBg41 sFRGD 

15 were used to transduce A549 cells at a particle to cell ratios of 0 up to 10,000. 
The cell monolayers were stained with X-gal 24 hours after infection and the 
percentage of cells expressing /?-galactosidase was determined by microscopic 
observation and counting of cells. Transductions were done in triplicate and 
three random fields in each well were counted, for a total of nine fields per 

20 vector. The results (Figure 23) show that the Av1 nBg41 sFRGD vector 

transduced cells to an equivalent level as Av1 nBgFKOI RGD at all vector doses 
examined. Neither FKOI or Ad41sF can bind CAR. The Ad41sF does not 
normally interact with CAR and additionally does not contain the HSP binding 
motif within the shaft domain. These data show that targeting peptides inserted 

25 into the loop regions of the fiber knob of K01 and Ad41sF allows for 

transduction of target cells via the targeted receptor. Surprisingly, HSP, not 
CAR and integrins, is the major entry route in vivo and ablation of HSP binding 
permits targeting of adenoviral vectors. 
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EXAMPLE 15 

Effect of the shaft modification on the biodistribution of adenoviral vectors in 
vivo 

The influence of fiber and penton modifications on the in vivo 
5 biodistribution of adenoviral vectors containing fiber head, shaft and penton 
mutations was examined. Vectors containing the HSP mutation combined with 
KOI , or PD1 , or a combination of all three mutations were evaluated as well as 
vectors containing the KOI mutation alone and the PD1 mutation alone. The 
indicated adenoviral vectors were systemically administered to C57BL6 mice as 

10 described above. A positive control cohort received AvlnBg and a negative 
control group received HBSS. Cohorts of five C57BL/6 mice received each 
vector via tail vein injection at a dose of 1 x 10 13 particles per kg. The animals 
were sacrificed approximately 72 hours after vector administration by carbon 
dioxide asphyxiation. Liver, heart, lung, spleen, and kidney were collected from 

15 each animal. Tissue from each organ was frozen to preserve it for real time PCR 
analysis to determine adenoviral hexon DNA content. A separate sample of liver 
from each mouse was frozen to preserve it for a chemiluminescent /?- 
galactosidase activity assay. Hexon real-time PCR and the chemiluminescent /?- 
galactosidase activity assay was carried out as described in Example 3. 

20 The results derived from the liver are described in Example 6 {Figure 14A 

and B) and also shown in Figure 26 with results presented as percent control of 
AvlnBg. The effect of the S* shaft modification on the biodistribution of 
adenovirus to the other organs is shown in Figure 25. The average adenoviral 
DNA content was determined as adenoviral genomic copies per cell and 

25 expressed as a percentage of the AvlnBg ( + ) control value. The average 

percent control value + standard deviation is shown (n = 5 per group) for each 
tissue examined (Figure 25). Systemic delivery of Ad5 based vectors with 

wild-type fiber results in a preferential accumulation of vector DNA in the liver 
with 64 copies per cell with significantly less DNA found in the other organs 

30 with 1 .32 copies per cell found in lung, 2.18 copies per cell in spleen, 0.47 
copies per cell found in heart, and 0.72 copies per cell in the kidney. All 
differences found with PD1, S*, K01PD1, KQ1S*, S*PD1, and K01S*PD1 were 
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significantly different than the AvInBg { + ) control using a unpaired, t-test 
analysis, P value { 0.024. When expressed as a percent of the AvInBg control 
values, the influence of each mutation, individually or in combination, becomes 
apparent. The S* mutation dramatically reduced gene transfer to all four organs, 
5 whereas, the KOI mutation did not. Thus, the importance of the shaft for 

transduction in vivo extends to organs besides the liver. Finally, gene transfer to 
the lung, heart, and kidney was diminished with PD1 suggesting a role for 
integrin binding in vector entry in these organs. 

EXAMPLE 16 

10 Retargeting the S*, shaft modification and the 41sF fiber in vivo 

Vectors containing the HSP mutation have been shown to effectively 
detarget adenoviral vectors in vivo (see examples 6 and 15). The objective of 
this study was to evaluate the ability to retarget vectors containing the S* 
modification or the Ad41sF to tumors in vivo. A cRGD peptide was genetically 

15 incorporated into the fiber HI loop and evaluated in vitro (Examples 8 and 14). 

These same vectors were then evaluated in vivo in tumor-bearing mice. Athymic 
nu/nu female mice were injected with 8 x 10 6 A549 cells on the right hind flank. 
When tumors reached approximately 1 00mm3 in size, they were randomized 
into treatment groups. Cohorts of 6 mice received each vector via tail vein 

20 injection at a dose of 1 x 10 13 particles per kg. The animals were sacrificed 
approximately 72 hours after vector administration by carbon dioxide 
asphyxiation. Tumor, liver, heart, lung, spleen, and kidney were collected from 
each animal. Tissue from each organ was frozen to preserve it for real time PCR 
analysis to determine adenoviral hexon DNA content. Hexon real-time PCR was 

25 carried out as described in example 3. A separate sample of liver from each 

mouse was frozen to preserve it for a chemiluminescent /?-galactosidase activity 
assay. Hexon real-time PCR and the chemiluminescent /?-galactosidase activity 
assay was carried out as described in example 3. 

The adenoviral vector biodistribution to the liver and tumor for each 

30 treatment group is shown in Figure 27. Vectors containing the S*, KOIS*, and 
41 sF fibers effectively detargeted the liver and tumor resulting in a significant 
reduction in the amount of adenoviral DNA found in each tissue in comparison to 
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the AvInBg control. Vectors containing the cRGD targeting ligand restored 
tranduction of the tumors to levels comparable to that achieved with the 
untargeted vector. 

These data demonstrate successful liver detargeting accompanied with 
5 tumor retargeting. The extent of tumor retargeting is relates to the affinity and 
type of ligand that is used. These data demonstrate the successful 
development of a targeted, systemically deliverable adenoviral vector that will 
target tumors in vivo. 

EXAMPLE 17 

10 Scale-Up Method For The Propagation Of Detargeted Adenoviral Vectors 

The growth and propagation of doubly or triply ablated adenoviral vectors 
requires novel scale up technologies. These detargeted vectors require 
alternative cellular entry strategies to allow for the efficient growth and 
generation of high titer preparations. A strategy for vector growth that is 

15 generally applicable to all detargeted adenoviral vectors, that does not require 
the development of new cell lines, and that aslo can be used for generating 
targeted vectors is provided herein. 

Three recombinant adenoviral vectors were prepared that contain single 
mutations in the fiber or penton or both mutations combined into one vector. 

20 These vectors are designated Av3nBgFK01, Av1 nBgPDI , and Av1 nBgFKOI PD1 , 
respectively. The construction of these vectors is described above and a general 
description of each vector can be found in Table 1 above. 

Scale-up of detargeted adenoviral vectors: A polycation, specifically 
hexadimethrine bromide was obtained from Sigma Chemical Co (St. Louis, MO), 

25 Catalog No. 52495, and was maintained in the medium at 4 //g/ml during the 
course of transfections and infections. To illustrate the affects of 
hexadimethrine bromide on the yield of detargeted adenoviral vectors the 
following experiment was carried out. Seven plates of AE1-2a adenoviral 
producer cells (Gorziglia et aL (1996) J. Virology 70:4173-4178) were 

30 transduced with 10 particles per cells of each of the indicated vectors (See Table 
4). Each vector was incubated with medium (Richters with 2% HI-FBS) 
containing hexadimethrine bromide at 4 /vg/ml for 30 min at room temperature 
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prior to infection. The infection was carried out for 2 hrs. Complete medium 
containing hexadimethrine bromide at 4 //g/ml was added to each plate. Final 
concentration of hexadimethrine bromide in all of these experiments was 
maintained at 4 >c/g/ml. The titers were determined spectrophotometrically using 
5 the conversion of 10D at A260nm per 1 x 10 12 particles (Mittereder et al. 

(1996) J Virology 70:7498-7509). The total particle yield was then normalized 
for the number of plates used for transduction. 

The inclusion of hexadimethrine bromide in the medium during the course 
of infection allows for the efficient propagation of detargeted adenoviral vectors 

10 containing fiber and penton mutations either alone or in combination. The affect 
of hexadimethrine bromide on vector yields is shown in Table 4. A 35-fold 
improvement in the yield of Av3nBgFK01 was found when hexadimethrine 
bromide was included in the culture medium and resulted in increasing the vector 
yield from 1.3 x 10 10 up to 4.6 x 10 11 vector particle per plate. Hexadimethrine 

15 bromide has a minimal effect on the yield of the Av1nBgPD1 adenoviral vector 
containing the penton, PD1 mutation with only a 1.2 fold improvement. The 
greatest effect using hexadimethrine bromide was found on the propagation of 
the doubly ablated adenoviral vector, Av1 nBgFKOI PD1 with increases in vector 
yield from barely detectable levels up to 4.53 x 10 10 vector particles per plate. 

20 These data demonstrate that use of nonspecific entry mechanisms allows for the 
efficient scale-up of detargeted adenoviral vectors. 



TABLE 4 

Efficient Scale-Up Of Detargeted Adenoviral Vectors Using hexadimethrine 

bromide 



Vector 


Vector Yield (particles/plate) 


Fold 
Improvement 


(-) hexadimethrine 
bromide 


{ + ) hexadimethrine 
bromide 


AvInBg 


3.89 x 10 11 


5.72 x 10 11 


1.47 


Av3nBg 


8.58 x 10 10 


2.38 x 10 11 


2.77 


Av3nBgFK01 


1.30 x 10 10 


4.60 x 10 11 


35.4 


Av1nBgPD1 


1.95 x 10 11 


2.40 x 10 11 


1.23 


Av1 nBgFKOI PD1 


TLTC* 


4.53 x 10 10 


t 
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*TLTC: Too low to count, a faint virus band was collected and the particle 
concentration was too dilute for titer determination. 
f Significant improvement 

The use of alternative polycations including protamine sulfate and 

5 poly-lysine as well as bifunctional proteins such as the anti-penton:TNFa fusion 

protein was investigated. Figure 24 show results that demonstrate all the 

reagents tested had some effect on enhancing transduction of the Av3nBgFK01 

vector. All of these compounds, when maintained in the medium during 

infection, enhanced transduction of the Av3nBgFK01 detargeted adenoviral 

10 vector. 

Bifunctional reagents:. The use of bifunctional reagents for the 
propagation of detargeted adenoviral vectors was examined using the 
anti-penton:TNFcr fusion protein. This particular reagent is a fusion protein 
between an antibody against Ad5 penton and the TNFa protein that is produced 

15 using stably transfected insect cells. This reagent will bind specifically to the 
adenoviral capsid via penton base and allow for binding to cell surface TNF 
receptors. The use of this reagent for the propagation of detargeted vectors is 
illustrated in Table 5 using Av3nBgFK01 (also shown in Figure 24). Monolayers 
of S8 cells were infected with 10 or 100 particles per cell of AvSnBgFKOI or a 

20 control vector in the presence or absence of 1 /vg/ml of the anti-penton:TNFa 

fusion protein. The monolayers were visually inspected over time for vector 

spread as indicated by the extent of cytopathic effect (CPE). The percentage of 

CPE at each time point is shown. The use of this bifunctional reagent clearly 

enhances the spread of the Av3nBgFKO! vector throughout the monolayer. 

25 TABLE 5 

Efficient Scale-Up Of Detargeted Adenoviral 
Vectors Using Bifunctional Reagents: Anti-Penton:TNFa 





10 ppc - 
anti-penton 
TNF 


10 ppc 4- 
anti-penton 
TNF 


100 ppc - 
anti-penton TNF 


100 ppc + 
anti-penton TNF 




Percentage of CPE 


AdBLud 


24 h 


0% 


0% 


0% 


0% 
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10 ppc - 
anti-penton 
TNF 


10 ppc + 
anti-penton 
TNF 


100 ppc - 
anti-penton TNF 


100 ppc 4- 
anti-penton TNF 




Percentage of CPE 


48 h 


20-30% 


20-30% 


90-100% 


90-100% 


72 h 


60-70% 


80-90% 


100% 


100% 


120 h 


100% 


100% 


100% 


100% 


AvSnBgKOI 24hrs 


24 h 


0% 


0% 


0% 


0% 


48 h 


0% 


10-20% 


0% 


90-100% 


72 h 


5% 


60-70% 


5% 


100% 


120 h 


40-50% 


100% 


100% 


100% 



10 

Since modifications will be apparent to those of skill in this art, it is 
intended that this invention be limited only by the scope of the appended claims. 
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WHAT IS CLAIMED IS: 

1 . A modified adenovirus capsid protein, 

the unmodified capsid protein binds to heparin sulfate 
proteoglycan (HSP); and 
5 the capsid protein comprises a mutation, whereby binding to 

heparin sulfate proteoglycan (HSP) is altered. 

2. The modified protein of claim 1 that is a fiber protein 

3. The capsid protein of claim 2, wherein the binding of the modified 
fiber protein is eliminated or reduced compared to the unmodified protein. 

10 4. The modified protein of claim 2, wherein the binding of the 

modified fiber protein is eliminated or reduced compared to the unmodified 
protein. 

5. The modified protein of claim 3 that comprises an insertion, 
deletion or replacement of amino acids. 
15 6. The modified protein of claim 2, wherein the mutation alters the 

motif that binds to HSP, whereby HSP interaction is altered. 

7. The modified protein of claim 6, motif is BBXB or BBBXXB, 
wherein the B is a basic amino acid and X is any amino acid. 

8. The modified protein of claim 7, wherein the motif comprises the 
20 consensus sequence KKTK. 

9. The modified protein of claim 2, wherein the fiber is a modified 
Ad5 or Ad2 fiber. 

10. A modified protein of claim 2 that is a chimeric fiber protein, 
comprising portions of fiber proteins from at least two different adenoviruses, 

25 wherein: 

a shaft or portion thereof is from a first adenovirus, whereby the resulting 
fiber does not bind to HSP or binds to HSP with reduced affinity compared to an 
unmodified fiber protein; 

a shaft or portion thereof from the first adenovirus does not bind 
30 to HSP or binds to HSP with reduced affinity compared to the second 
adenovirus; 

the second adenovirus binds to HSP; and 
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the portion comprises a sufficient portion to alter HSP binding of 
the resulting protein. 

1 1 . The modified protein of claim 10, wherein the binding to HSP of 
the modified fiber protein is eliminated or reduced compared to the unmodified 

5 protein. 

12. The modified protein of claim 10, wherein the remainder of the fiber 
protein is from the second adenovirus. 

13. The modified protein of any of claims 2, 3, 10 and 1 1, further 
comprising one or more further modifications that reduce or eliminate interaction 

10 of the resulting fiber with one or more cell surface proteins in addition to HSP. 

14. The modified protein of claim 13, further comprising a ligand, 
whereby the resulting fiber binds to a receptor for the ligand. 

15. The modified protein of claim 14, wherein the ligand is included in 
the knob region. 

15 16. The modified protein of claim 14, wherein the ligand is inserted or 

it replaces a portion of the fiber, whereby the resulting fiber binds to a receptor 
for the ligand. 

17. A modified protein of claim 1 1 , wherein affinity for HSP is reduced 
at least by an amount selected from among reduced 5-fold, 10-fold and 100- 

20 fold. 

18. The modified protein of claim 1 1, wherein the first adenovirus is 
selected from the group consisting of subgroup B, D or F, and the second is of 
subgroup C. 

19. The modified protein of claim 10, wherein the first adenovirus is 
25 selected from the group consisting of Ad3, Ad35, Ad7, Ad1 1, Ad16, Ad21, 

Ad34, Ad40, Ad41 and Ad46. 

20. The modified protein of claim 18, wherein the second adenovirus 
is Ad5 or Ad2. 

21 . The modified protein of claim 19, wherein the second adenovirus 
30 is Ad5 or Ad2. 

22. A modified protein of claim 1 selected from the group consisting of 
a fiber protein comprising: 
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the sequence of amino acids set forth in any of SEQ ID Nos. 52, 54, 56, 
58, 62, 66, 70 and 72; or 

a sequence of amino acids having 90% sequence identity with a 
sequence of amino acids set forth in any of SEQ ID Nos. 52, 54, 56, 58, 62, 66, 
5 70 and 72; or 

a sequence of amino acids encoded by a sequence of nucleotides that 
hybridizes under conditions of high stringency along at least 70% of its length to 
a sequence of nucleotides that encodes a sequence of amino acids set forth in 
any of SEQ ID Nos. 52, 54, 56, 58, 62, 66, 70 and 72. 
10 23. A nucleic acid molecule encoding a modified protein of any of 

claims 1-1 2 and 14-22. 

24. A nucleic acid molecule encoding a modified protein of claim 13. 

25. The nucleic acid molecule of claim 23 that comprises a vector. 

26. The nucleic acid molecule of claim 24 that comprises vector. 

15 27. The nucleic acid molecule of claim 25 that is an adenovirus vector. 

28. The nucleic acid molecule of claim 26 that is an adenovirus vector. 

29. The vector of claim 27 that is an adenoviral vector from a 
subgroup B, C or D adenovirus. 

30. The vector, of claim 28 that is an adenoviral vector from a 
20 subgroup B, C or D adenovirus. 



31. 



A cell, comprising a nucleic acid molecule of claim 23. 
A cell, comprising a nucleic acid molecule of claim 24. 
The cell of claim 31 that is a eukaryotic cell. 



32. 



33. 



34. 



The cell of claim 32 that is a eukaryotic cell. 

A cell, comprising a nucleic acid molecule of claim 27, wherein: 

the cell is a eukaryotic cell; and 

the cell in a packaging cell. 



25 



35. 



36. 



A cell, comprising a nucleic acid molecule of claim 28, wherein: 
the cell is a eukaryotic cell; and 
the cell in a packaging cell. 



30 
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37. An adenoviral particle, comprising a modified protein of any of 
claims 1-12 and 14-22, whereby binding of the viral particle to HSP is altered 
compared to a particle that expresses an unmodified fiber. 

38. An adenoviral particle, comprising a modified protein of claim 13, 

5 whereby binding of the viral particle to HSP is altered compared to a particle that 
expresses an unmodified fiber. 

39. An adenoviral particle of claim 37, wherein a native receptor for 
the fiber is coxsackie-adenovirus receptor (CAR). 

40. The adenoviral particle of claim 39, further comprising a mutation 
10 in the CAR-binding region of the capsid. 

41 . The adenoviral particle of claim 39, further comprising a mutation 
in the a v integrin-binding region of the capsid, whereby binding to the integrin is 
eliminated or reduced. 

42. The adenoviral particle of claim 40, further comprising a mutation 
15 in the a v integrin-binding region of the capsid, whereby binding to the integrin is 

eliminated or reduced 

43. The adenoviral particle of claim 39, wherein the CAR-binding 
region of the capsid modified is on a fiber knob. 

44. The adenoviral particle of claim 43, wherein the fiber knob 
20 modification is in the AB loop or CD loop. 

45. The adenoviral particle of claim 44, wherein the fiber knob 
modification is selected from the group consisting of KOI and K012. 

46. The adenoviral particle of claim 39, wherein the adenovirus is a 
subgroup C, D or F adenovirus. 

25 47. The adenoviral particle of claim 46, wherein the subgroup C virus 

is Ad2 or Ad5, the subgroup D virus is Ad46 and the subgroup F virus is Ad41 . 

48. The adenoviral vector of claim 27 that is an early generation 
adenoviral vector, a gutless adenoviral vector or a replication-conditional 
adenoviral vector. 

30 49. The adenoviral vector of claim 28 that is an early generation 

adenoviral vector, a gutless adenoviral vector or a replication-conditional 
adenoviral vector. 



WO 2003/062400 



PCT/US2003/002295 



-94- 

50. The adenoviral vector of claim 48, wherein the replication-condi- 
tional adenoviral vector is an oncolytic adenoviral vector. 

51. The adenoviral vector of claim 49, wherein the replication-condi- 
tional adenoviral vector is an oncolytic adenoviral vector. 

52. The adenoviral vector of claim 27 that comprises heterologous 
nucleic acid. 

53. The adenoviral vector of claim 28 that comprises heterologous 
nucleic acid. 

54. The adenoviral vector of claim 52, wherein the heterologous 
nucleic acid encodes a polypeptide. 

55. The adenoviral vector of claim 53, wherein the heterologous 
nucleic acid encodes a polypeptide. 

56. The adenoviral vector of claim 52, wherein the heterologous 
nucleic acid comprises or encodes a regulatory nucleic acid. 

57. The adenoviral vector of claim 53, wherein the heterologous 
nucleic acid comprises or encodes a regulatory nucleic acid. 

58. The adenoviral vector of claim 52, wherein the heterologous 
nucleic acid comprises or encodes a promoter or RNA. 

59. The adenoviral vector of claim 53, wherein the heterologous 
nucleic acid comprises or encodes a promoter or RNA. 

60. The adenoviral vector of claim 59, wherein the promoter is a cell 
or tissue specific promoter. 

61 . The adenoviral vector of claim 59, wherein the promoter is 
operably linked to a gene of an adenovirus essential for replication. 

62. The adenoviral vector of claim 60, wherein the tissue specific 
promoter is a tumor specific promoter. 

63. The adenoviral vector of claim 58, wherein the polypeptide is a 
therapeutic polypeptide. 

64. A method of expressing heterologous nucleic acid in a cell, 
comprising transducing the cell with an adenoviral vector of claim 57. 

65. The method of claim 64, wherein: 
the cell is a tumor cell; 
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the adenoviral vector is an oncolytic vector; and 
the cell is killed . 

66. The method of claim 64, wherein the cell is a mammalian cell. 

67. The method of claim 64, wherein the cell is a primate cell. 
5 68. The method of claim 67, wherein the cell is a human cell. 

69. A method of reducing transduction of liver cells by an adenoviral 
particle, comprising reducing or eliminating binding of the particle to heparin 
sulfate proteoglycans (HSPs) on the liver cells. 

70. A scale up method for the propagation of a detargeted adenoviral 
10 particle, comprising: 

infecting a cell capable of replicating, maturing and packaging an 
adenoviral vector with a detargeted adenoviral vector in the presence of a 
reagent that results in entry of the adenoviral particle into the cell; 

culturing the infected cell under conditions suitable for growth, spread 
15 and propagation of the adenoviral vector; and 

recovering the resulting adenoviral particles. 

71 . The method of claim 70, wherein the reagent is a polycation. 

72. The method of claim 71, wherein the polycation is selected from 
the group consisting of hexadimethrine bromide, polyethylenimine, protamine 

20 sulfate and poly-L-lysine. 

73. The method of claim 70, wherein the reagent is a bifunctional 
protein that binds to the adenoviral particle and to a receptor on the cell. 

- 74. The method of claim 73, wherein: 
the bifunctional protein is selected from the group consisting of an 
25 anti-fiber antibody ligand fusion, an anti-fiber-Fab-FGF conjugate, an 

anti-penton-antibody ligand fusion, an anti-hexon antibody ligand fusion and a 
poiylysine-peptide fusion, wherein the ligand is a ligand that binds to the 
receptor. 

75. The method of any one of claims 70-74, wherein the detargeted 
30 adenoviral particle expresses a modified capsid, whereby binding to at least one 
host cell receptor is reduced or eliminated compared with a wild-type 
adenovirus. 
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76. The method of claim 75, wherein the adenoviral particle is 
modified to eliminate or reduce binding with one host cell receptor. 

77. The method of claim 75, wherein the adenoviral particle is 
modified to eliminate or reduce binding with two host cell receptors. 

5 78. The method of claim 75, wherein the adenoviral particle is 

modified to eliminate or reduce binding with three host cell receptors. 

79. The, method of claim 75, wherein the particle is modified with one 
or more mutations selected from the group consisting of mutations that reduce 
or eliminate interactions with one or more of cr v integrins, coxsackie-adenovirus 

10 receptors (CAR), and heparin sulfate proteoglycans (HSP). 

80. The method of claim 79, wherein the mutation is selected from the 
group consisting of PD1, KOI, K012 and S*. 

81. The modified protein of claim 2, wherein the mutation is in the 
shaft of a fiber. 

15 82. A modified protein of claim 3, wherein affinity for HSP is reduced 

at least by an amount selected from among reduced 5-fold, 10-fold and 100- 
fold. 
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FIG. 11 



SUBSTITUTE SHEET (RULE 26) 



WO 2003/062400 



PCT7US2003/002295 



15/35 




FIG. 12 



SUBSTITUTE SHEET (RULE 26) 



WO 2003/062400 



PCT7US2003/002295 



16/35 



120 




2000 4000 6000 8000 
Particles per Cell 

FIG. 13A 



10000 



-♦— WT 
-»_S* 

-*-S*PD1 



-^-K01S* 
-B — K01S*PD1 



c 
o 



80 -, 
70 



o ~ 60 

3 TJ 

" 50 



T5 

c 



- c 40 
4-j S 30 

g -5. 20 
| 10 
0 



5 



<3 



3 



Vector 

FIG. 13B 



SUBSTITUTE SHEET (RULE 26) 



WO 2003/062400 



PCT7US2003/002295 



17/35 



□ No competitor 
■ Fiber 
^RGD 
ED Heparin 



120 

c 

•S 100 

O — s 

■5 "3 so 

to ^ 

St 60 



r— ro 

i- CD 
Q- 



40 
20 
0 



1 




3 



n 




I 1 



»V> <P 

4? <f> <f 



Vector 



FIG. 13C 



SUBSTITUTE SHEET (RULE 26) 



WO 2003/062400 



PCT/US2003/002295 



18/35 



> 

< a> 

<n Si 

2 a) 

5 3 

is * 

9 

CO. 



7000 -I 
6000- 
5000- 
4000 - 
3000- 
2000- 
1000- 
0 



HBSS 



wt 



K01 



PD1 K01PD1 S* KOIS* PD1S* . KQ1S*PD1 



FIG. 14A 



o 

o 

CL 
<D + 

"5L c 

O CO 

O <£ 

c j= 
o 

X 

o 



100 
90 
80 
70 
60 
50 
40 
30 
20 
10 



ill 



L o N „o - ><? 



FIG. 14B 



SUBSTITUTE SHEET (RULE 26) 



WO 2003/062400 



PCT7US2003/002295 



19/35 



BamHI . 
Clal :j 
Csp45l | 

BstBl ; 
EcoRI il 




FIG. 15A 



SUBSTITUTE SHEET (RULE 26) 



WO 2003/062400 



PCT7US2003/002295 



20/35 



Muni 
BsrGI. 



sm.. 

EcoRI.. 
Spel 



BamHI 
Clal. 
Csp45l 
BstBI 
EcoRI 



Ndel. 



.Muni 
•'.BsrGI 




\ Pmel 
Notl 



BsrGI 

BamHI i 
Sgfl 



AscI 
Notl 
Notl 



Ndel 
BsrGI 



FIG. 15B 



SUBSTITUTE SHEET (RULE 26) 



WO 2003/062400 



PCT7US2003/002295 



21/35 




AvInBg S* S*RGD S*KQ1 S*KQ1RGD 



FIG. 16 



SUBSTITUTE SHEET (RULE 26) 



WO 2003/062400 



PCT7US2003/002295 



22/35 



BamHI 
Clal, 




FIG. 17A 



SUBSTITUTE SHEET (RULE 26) 



WO 2003/062400 



PCT7US2003/002295 



23/35 



32152,C/al. 
32144,BsfBlj _Mun 1,642 




18155,Sgfl' 



FIG. 17B 



SUBSTITUTE SHEET (RULE 26) 



WO 2003/062400 



PCT7US2003/002295 



2^/35 




FIG. 18A 



SUBSTITUTE SHEET (RULE 26) 



WO 2003/062400 



PCT7US2003/002295 



25/35 



32923, C/a I. 
32915,BsfBlj Mun 1,642 




18155,Sgfl' 



FIG. 18B 



SUBSTITUTE SHEET (RULE 26) 



WO 2003/062400 



PCT/US2003/002295 



26/35 



♦ 


-5F 


— m- 


-35F 


— 7k~ 


-5T35H 


— *- 


-35T5H 



100 -, 




1200 



Particles Per Cell 



FIG. 19 



SUBSTITUTE SHEET (RULE 26) 



WO 2003/062400 



27/35 



PCT7US2003/002295 



> 

< o 

0) "o 

(0 t. 

(0 Q. 

IS o) 

CO 5=* 

© 
i 



4000 
3500 
3000 
2500 
2000 
1500 
1000 
500 
0 




- - 



HBSS AvInBg 35F 5T35H 35T5H 

Treatment Groups 



FIG. 20 



SUBSTITUTE SHEET (RULE 26) 



WO 2003/062400 



28/35 



PCT7US2003/002295 



Clal. 
BstBlj .Muni 




Sgfl' 



FIG. 21 A 



SUBSTITUTE SHEET (RULE 26) 



WO 2003/062400 



PCT7US2003/002295 



29/35 



Clal . 
BstBI [ 




Sgfl 



FIG. 21 B 



SUBSTITUTE SHEET (RULE 26) 



WO 2003/062400 



PCT/US2003/002295 



30/35 



1 




8 8 3 8 8 £ ° 



LL. 
C/D 



co- 
urt 



to 

a si 

CO. 3 



5 <5s 




CM 

cm; 
| g 

In U. 



lieQ J9d sefdoQ jejiAouepv 



SUBSTITUTE SHEET (RULE 26) 



WO 2003/062400 



PCT7US2003/002295 



31/35 





(PS -/+ UBOUl) 

uoijonpsuBjj. ^ueoaed 



o 
o 

o. 
o 



o 
o 
o 

00 _ 



o 
o 
o 

CD 



o 
o 
o 



o 
o 
o 

CN 



CD 

O 

o 

CL 

U) 
_0> 

O 

mwm 

CO 



CO 
CM 

O 
LL 



SUBSTITUTE SHEET (RULE 26) 



WO 2003/062400 



PCT7US2003/002295 



32 /35 



H 



H 



O 
Q- 
Q. 



O 

© ^ 
O li- 
ra S 5 

E > 
< < 



h ; 



r - 
O 



o 

CD 



O 

in 



o 



o 

CO 



o 



I 

o 



(PS + UB8LU) 

uoijonpsuBJi iueojQd 



CL 



* © 

CM 



CL =L 



60 



CD 



G) 



3 C 
.i= O 
> W 



© CM 

E . 
to O 



SUBSTITUTE SHEET (RULE 26) 



WO 2003/062400 



PCT7US2003/002295 




SUBSTITUTE SHEET (RULE 26) 



WO 2003/062400 



PCT/US2003/002295 



34/35 




* 
* 

* i 
* 

* 
* 



LQdUSlOX 



K3d*S 



*S1C»I 




CD 

o 

LL 



o 
o 

CM 



o 



O 

o 



o 



o 



SUBSTITUTE SHEET (RULE 26) 



WO 2003/062400 



PCT7US2003/002295 



35/35 



O 
E 

I— 



CO 

O 



* 

CO 



d 



CO 

d 



CM T- 

d d 



H^O J^d S9ldO0 UOX3H 



Q. 

o 

■2 a. 

x 8 

o 

c ■= O 
o o rr 
O Z o 



1 



UL. 



* 

CO 

I 



CD 
> 



O 



lo o to o io o 

CO CO CN CM r- t- 



||O0 i^d seidoo uoxaH 

SUBSTITUTE SHEET (RULE 26) 



GO 

CM 

o 



< 

CM 

O 
U. 



WO 2003/062400 



PCT/US2003/002295 



-1- 



SEQUENCE LISTING 
<110> The Scripps Research Institute 
Novartis AG 
Kaleko, Michael 
Nemerow, Glen R. 
Smith, Theodore 
Stevenson, Susan C. 

<12 0> Fiber Shaft Modifications for Efficient Targeting 

<13 0> 22908 -123 6PC 

<14 0> Not yet assigned 
<141> Herewith 

<150> 60/350,388 
<151> 2002-01-24 

<150> 60/391,967 
<151> 2002-06-26 

<160> 72 

<170> Patentln version 3.0 

<210> 1 

<211> 4 

<212> PRT 

<213> adenovirus serotype 5 

<400> 1 

Lys Lys Thr Lys 

1 

<210> 2 
<211> 1746 
<212> DNA 

<213> adenovirus serotype 5 
<400> 2 

atgaagcgcg caagaccgtc tgaagatacc ttcaaccccg tgtatccata tgacacggaa 60 

accggtcctc caactgtgcc ttttcttact cctccctttg tatcccccaa tgggtttcaa 120 

gagagtcccc ctggggtact ctctttgcgc ctatccgaac ctctagttac ctccaatggc 180 

atgcttgcgc tcaaaatggg caacggcctc tctctggacg aggccggcaa ccttacctcc 240 

caaaatgtaa ccactgtgag cccacctctc aaaaaaacca agtcaaacat aaacctggaa 3 00 

atatctgcac ccctcacagt tacctcagaa gccctaactg tggctgccgc cgcacctcta 3 60 

atggtcgcgg gcaacacact caccatgcaa tcacaggccc cgctaaccgt gcacgactcc 42 0 

aaacttagca ttgccaccca aggacccctc acagtgtcag aaggaaagct agccctgcaa 48 0 

acatcaggcc ccctcaccac caccgatagc agtaccctta ctatcactgc ctcaccccct 540 

ctaactactg ccactggtag cttgggcatt gacttgaaag agcccattta tacacaaaat 60 0 

ggaaaactag gactaaagta cggggctcct ttgcatgtaa cagacgacct aaacactttg 660 

accgtagcaa ctggtccagg tgtgactatt aataatactt ccttgcaaac taaagttact 72 0 

ggagccttgg gttttgattc acaaggcaat atgcaactta atgtagcagg aggactaagg 780 

attgattctc aaaacagacg ccttatactt gatgttagtt atccgtttga tgctcaaaac 84 0 

caactaaatc taagactagg acagggccct ctttttataa actcagccca caacttggat 90 0 

attaactaca acaaaggcct ttacttgttt acagcttcaa acaattccaa aaagcttgag 96 0 

gttaacctaa gcactgccaa ggggttgatg tttgacgcta cagccatagc cattaatgca 102 0 

ggagatgggc ttgaatttgg ttcacctaat gcaccaaaca caaatcccct caaaacaaaa 10 8 0 

attggccatg gcctagaatt tgattcaaac aaggctatgg ttcctaaact aggaactggc 1140 

cttagttttg acagcacagg tgccattaca gtaggaaaca aaaataatga taagctaact 120 0 

ttgtggacca caccagctcc agaggctaac tgtagactaa atgcagagaa agatgctaaa 126 0 
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ctcactttgg 
gttaaaggca 
tttgacgaaa 
agaaatggag 
ctatcagctt 
tacttaaacg 
gaaacaggag 
cacaactaca 
gaataa 



tcttaacaaa 
gtttggctcc 
atggagtgct 
atcttactga 
atccaaaatc 
gagacaaaac 
acacaactcc 
ttaatgaaat 



atgtggcagt 
aatatctgga 
actaaacaat 
aggcacagcc 
tcacggtaaa 
taaacctgta 
aagtgcatac 
atttgccaca 



caaatacttg 
acagttcaaa 
tccttcctgg 
tatacaaacg 
actgccaaaa 
acactaacca 
tctatgtcat 
tcctcttaca 



ctacagtttc 
gtgctcatct 
acccagaata 
ctgttggatt 
gtaacattgt 
ttacactaaa 
tttcatggga 
ctttttcata 



agttttggct 
tattataaga 
ttggaacttt 
tatgcctaac 
cagtcaagtt 
cggtacacag 
ctggtctggc 
cattgcccaa 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1746 



<210> 3 

<211> 1746 

<212> DNA 

<213> adenovirus serotype 5 



<400> 3 

atgaagcgcg 

accggtcctc 

gagagtcccc 

atgcttgcgc 

caaaatgtaa 

atatctgcac 

atggtcgcgg 

aaacttagca 

acatcaggcc 

ctaactactg 

ggaaaactag 

accgtagcaa 

ggagccttgg 

attgattctc 

caactaaatc 

attaactaca 

gttaacctaa 

ggagatgggc 

attggccatg 

cttagttttg 

ttgtggacca 

ctcactttgg 

gttaaaggca 

tttgacgaaa 

agaaatggag 

ctatcagctt 

tacttaaacg 

gaaacaggag 

cacaactaca 

gaataa 



caagaccgtc 
caactgtgcc 
ctggggtact 
tcaaaatggg 
ccactgtgag 
ccctcacagt 
gcaacacact 
ttgccaccca 
ccctcaccac 
ccactggtag 
gactaaagta 
ctggtccagg 
gttttgattc 
aaaacagacg 
taagactagg 
acaaaggcct 
gcactgccaa 
ttgaatttgg 
gcctagaatt 
acagcacagg 
caccagctcc 
tcttaacaaa 
gtttggctcc 
atggagtgct 
atcttactga 
atccaaaatc 
gagacaaaac 
acacaactcc 
ttaatgaaat 



tgaagatacc 
ttttcttact 
ctctttgcgc 
caacggcctc 
cccacctctc 
tacctcagaa 
caccatgcaa 
aggacccctc 
caccgatagc 
cttgggcatt 
cggggctcct 
tgtgactatt 
acaaggcaat 
ccttatactt 
acagggccct 
ttacttgttt 
ggggttgatg 
ttcacctaat 
tgattcaaac 
tgccattaca 
atctcctaac 
atgtggcagt 
aatatctgga 
actaaacaat 
aggcacagcc 
tcacggtaaa 
taaacctgta 
aagtgcatac 
atttgccaca 



ttcaaccccg 
cctccctttg 
ctatccgaac 
tctctggacg 
aaaaaaacca 
gccctaactg 
tcacaggccc 
acagtgtcag 
agtaccctta 
gacttgaaag 
ttgcatgtaa 
aataatactt 
atgcaactta 
gatgttagtt 
ctttttataa 
acagcttcaa 
tttgacgcta 
gcaccaaaca 
aaggctatgg 
gtaggaaaca 
tgttcactaa 
caaatacttg 
acagttcaaa 
tccttcctgg 
tatacaaacg 
actgccaaaa 
acactaacca 
tctatgtcat 
tcctcttaca 



tgtatccata 
tatcccccaa 
ctctagttac 
aggccggcaa 
agtcaaacat 
tggctgccgc 
cgctaaccgt 
aaggaaagct 
ctatcactgc 
agcccattta 
cagacgacct 
ccttgcaaac 
atgtagcagg 
atccgtttga 
actcagccca 
acaattccaa 
cagccatagc 
caaatcccct 
ttcctaaact 
aaaataatga 
atggaggcgg 
ctacagtttc 
gtgctcatct 
acccagaata 
ctgttggatt 
gtaacattgt 
ttacactaaa 
tttcatggga 
ctttttcata 



tgacacggaa 
tgggtttcaa 
ctccaatggc 
ccttacctcc 
aaacctggaa 
cgcacctcta 
gcacgactcc 
agccctgcaa 
ctcaccccct 
tacacaaaat 
aaacactttg 
taaagttact 
aggactaagg 
tgctcaaaac 
caacttggat 
aaagcttgag 
cattaatgca 
caaaacaaaa 
aggaactggc 
taagctaact 
agatgctaaa 
agttttggct 
tattataaga 
ttggaacttt 
tatgcctaac 
cagtcaagtt 
cggtacacag 
ctggtctggc 
cattgcccaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1746 



<210> 4 

<211> 1737 

<212> DNA 

<213> adenovirus serotype 5 



<400> 4 

atgcggcgcg 

gcggcgccag 

gtgcctccgc 

gcacccctat 

tccctgaact 

tacagcccgg 

ggcgacctga 

aataagttta 



cggcgatgta 
tggcggcggc 
ggtacctgcg 
tcgacaccac 
accagaacga 

gggaggcaag 

aaaccatcct 
aggcgcgggt 



tgaggaaggt 
gctgggttct 
gcctaccggg 
ccgtgtgtac 
ccacagcaac 
cacacagacc 
gcataccaac 
gatggtgtcg 



cctcctccct 
cccttcgatg 
gggagaaaca 
ctggtggaca 
tttctgacca 
atcaatcttg 
atgccaaatg 
cgcttgccta 



cctacgagag 
ctcccctgga 
gcatccgtta 
acaagtcaac 
cggtcattca 
acgaccggtc 
tgaacgagtt 
ctaaggacaa 



tgtggtgagc 
cccgccgttt 
ctctgagttg 
ggatgtggca 
aaacaatgac 
gcactggggc 
catgtttacc 
tcaggtggag 



60 
120 
180 
240 
300 
360 
420 
480 
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ctgaaatacg 
atagacctta 
gttctggaaa 
cccgtcactg 
attttgctgc 
atccgcaagc. 
ggtaacattc 
gaacagggcg 
tccaacgcgg 
ccctacgacg 
gaggccgaag 
cagaagaaac 
ataagcaatg 
gaccctcaga 
tcggagcagg 
acgcgccaga 
agcttctaca 
cacgtgttca 
accaccgtca 
agcatcggag 
gtttacaagg 



agtgggtgga 
tgaacaacgc 
gcgacatcgg 
gtcttgtcat 
caggatgcgg 
ggcaaccctt 
ccgcactgtt 

ggggtggcgc 

cagccgcggc 
tgcccgacta 
cagcggccga 
cggtgatcaa 
acagcacctt 
ccggaatccg 
tctactggtc 
tcagcaactt 
acgaccaggc 
atcgctttcc 
gtgaaaacgt 
gagtccagcg 
ccctgggcat 



gttcacgctg 
gatcgtggag 
ggtaaagttt 
gcctggggta 
ggtggacttc 
ccaggagggc 
ggatgtggac 
aggcggcagc 
aatgcagccg 
cgcgggcacc 
agctgccgcc 
acccctgaca 
cacccagtac 
ctcatggacc 
gttgccagac 
tccggtggtg 
cgtctactcc 
cgagaaccag 
tcctgctctc 
agtgaccatt 
agtctcgccg 



cccgagggca 
cactacttga 
gacacccgca 
tatacaaacg 
acccacagcc 
tttaggatca 
gcctaccagg 
aacagcagtg 
gtggaggaca 
agcgccacac 
cccgctgcgc 
gaggacagca 
cgcagctggt 
ctgctttgca 
atgatgcaag 
ggcgccgagc 
caactcatcc 
attttggcgc 
acagatcacg 
actgacgcca 
cgcgtcctat 



actactccga 
aagtgggcag 
acttcagact 
aagccttcca 
gcctgagcaa 
cctacgatga 
cgagcttgaa 
gcagcggcgc 
tgaacgatag 
gggctgagga 
aacccgaggt 
agaaacgcag 
accttgcata 
ctcctgacgt 
accccgtgac 
tgttgcccgt 
gccagtttac 
gcccgccagc 
ggacgctacc 
gacgccgcac 
cgagccgcac 



gaccatgacc 
acagaacggg 

ggggtttgac 

tccagacatc 
cttgttgggc 
tctggagggt 
agatgacacc 
ggaagagaac 
ccgcggctac 
gaagcgcgct 
cgagaagcct 
ttacaaccta 
caactacggc 
aacctgcggc 
cttccgctcc 
gcactccaag 
ctctctgacc 
ccccaccatc 
gctgcgcaac 
ctgcccctac 
tttttga 



540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1737 



<210> 5 

<211> 20 

<212> DNA 

<213> adenovirus serotype 5 



<400> 5 

gaacaggagg tgagcttaga 



20 



<210> 6 

<211> 43 

<212> DNA 

<2 13 > adenovirus serotype 5 



<400> 6 

tccgcctcca tttagtgaac agttaggaga tggagctggt gtg 



43 



<210> 7 

<211> 44 

<212> DNA 

<213> adenovirus serotype 5 



<400> 7 

tcactaaatg gaggcggaga tgctaaactc actttggtct taac 



44 



<210> 8 

<211> 20 

<212> DNA 

<213 > adenovirus serotype 5 



<400> 8 

gtggcaggtt gaatactagg 



20 



<210> 9 

<211> 8 

<212> PRT 

<213> adenovirus serotype 5 



<400> 9 

His Ala lie Arg Gly Asp Thr Phe 
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<210> 10 

<211> 15 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> mofified sequence for penton protein 

<400> 10 

Ser Arg Gly Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Gly Thr Ser 
1 5 10 15 

<210> 11 

<211> 57 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide for mutation generation 

<400> 11 

cgcggaagag aactccaacg cggcagccgc ggcaatgcag ccggtggagg acatgaa 57 

<210> 12 

<211> 59 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide for mutation generation 

<400> 12 

tatcgttcat gtcctccacc ggctgcattg ccgcggctgc cgcgttggag ttctcttcc 59 

<210> 13 

<211> 75 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide for mutation generation 

<400> 13 

cgatagccgc ggctacccct acgacgtgcc cgactacgcg ggcaccagcg ccacacgggc 6 0 

tgaggagaag cgcgc 75 

<210> 14 

<211> 73 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide for mutation generation 

<400> 14 

tcagcgcgct tctcctcagc ccgtgtggcg ctggtgcccg cgtagtcggg cacgtcgtag 60 
gggtagccgc ggc ~ 73 

<210> 15 

<211> 40 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide for mutation generation 
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<400> 15 

ggctccggct ccgagaggtg ggctcacagt ggttacattt 4 0 

<210> 16 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<22 0> 

<223> oligonucleotide for mutation generation 

<400> 16 

ggagccggag cctcaaacat aaacctggaa at 32 

<210> 17 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 17 

ctctagaaat ggacggaatt attacag 27 

<210> 18 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 18 

tcttggtcat ctgcaacaac atgaagatag tg 3 2 

<210> 19 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 19 

gttgttgcag atgaccaaga gagtccggct ca 32 

<210> 20 

<211> 73 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 20 

agcaattgaa aaataaacac gttgaaacat aacacaaacg attctttagt tgtcgtcttc 60 
tgtaatgtaa gaa 73 

<210> 21 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 



<400> 21 
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agcaattgaa aaataaacac gttg 



24 



<210> 22 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 22 

gaacaggagg tgagcttaga 2 0 

<210> 23 

<211> 42 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 



<210> 24 

<211> 42 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 24 

gataagctaa ctttgtggac cggaataaac cctccaccta ac 42 

<210> 25 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 



<210> 26 

<211> 41 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 26 

gttaggagat ggagctggtg tagtccataa ggtgttaata c 41 

<210> 27 

<211> 41 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 



<400> 23 

gttaggtgga gggtttattc cggtccacaa agttagctta tc 



42 



<400> 25 

gtggcaggtt gaatactagg 



20 



<400> 27 

gtattaacac cttatggact acaccagctc catctcctaa c 



41 
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<210> 28 

<211> 54 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 28 

tgcgcaaaaa caatcaccac gacaatcaca atgtacattg gaagaaatca tacg 54 

<210> 29 

<211> 54 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 29 

acattgtgat tgtcgtggtg attgtttttg cgcatatgcc atacaatttg aatg 54 

<210> 30 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<22 3> RGD targeting peptide 
<400> 30 

His Cys Asp Cys Arg Gly Asp Cys Phe Cys 
15 10 

<210> 31 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 31 

ttcttttcat ctgcaacaac atgaagatag tg 32 

<210> 32 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 32 

gttgttgcag atgaaaagaa ccagaattga ag 32 

<210> 33 
<211> 73 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 
<400> 33 

tgcaattgaa aaataaacac gttgaaacat aacacaaacg attctttatt cttcagttat 60 
gtagcaaaat aca 73 
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<210> 34 

<211> 56 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 34 

agtacaaaaa caatcaccac gacaatcaca gtttatctcg ttgtagacga cactga 5 6 

<210> 35 

<211> 51 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 35 

tgtgattgtc gtggtgattg tttttgtact agtgggtatg cttttacttt t 51 

<210> 36 

<211> 4 

<212> PRT 

<213> Adenovirus type 5 

<400> 36 

Thr Leu Trp Thr 

1 

<210> 37 

<211> 7 

<212> PRT 

<213> SV40 

<400> 37 

Pro Lys Lys Lys Arg Lys Val 
1 5 

<210> 38 

<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 38 

cttcgatgat gccgcagtg 19 

<210> 39 

<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 39 

gggctcaggt actccgagg 19 

<210> 40 

<211> 25 

<212> DNA 
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<213> Artificial Sequence 
<220> 

<2 23> amplification primer 
<400> 40 

ttacatgcac atctcgggcc aggac 2 5 

<210> 41 
<211> 7607 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Plasmid GRE5-El-SV4 0-Hygro 
<400> 41 

tctagaagat ccgctgtaca ggatgttcta gctactttat tagatccgct gtacaggatg 60 
ttctagctac tttattagat ccgctgtaca ggatgttcta gctactttat tagatccgct 12 0 
gtacaggatg ttctagctac tttattagat ccgtgtacag gatgttctag ctactttatt 18 0 
agatcgatct cctggccgtt cggggtcaaa aaccaggttt ggctataaaa gggggtgggg 240 
gcgcgttcgt cctcactctc ttccgcatcg ctgtctgcga gggccaggat cgatcctgag 300 
aacttcaggg tgagtttggg gacccttgat tgttctttct ttttcgctat tgtaaaattc 3 60 
atgttatatg gagggggcaa agttttcagg gtgttgttta gaatgggaag atgtcccttg 42 0 
tatcaccatg gaccctcatg ataattttgt ttctttcact ttctactctg ttgacaacca 480 
ttgtctcctc ttattttctt ttcattttct gtaacttttt cgttaaactt tagcttgcat 540 
ttgtaacgaa tttttaaatt cacttttgtt tatttgtcag attgtaagta ctttctctaa 60 0 
tcactttttt ttcaaggcaa tcagggtata ttatattgta cttcagcaca gttttagaga 660 
acaattgtta taattaaatg ataaggtaga atatttctgc atataaattc tggctggcgt 72 0 
ggaaatattc ttattggtag aaacaactac atcctggtca tcatcctgcc tttctcttta 780 
tggttacaat gatatacact gtttgagatg aggataaaat actctgagtc caaaccgggc 84 0 
ccctctgcta accatgttca tgccttcttc tttttcctac agctcctggg caacgtgctg 90 0 
gttattgtgc tgtctcatca ttttggcaaa gaattagatc taagcttctg cagctcgagg 96 0 
actcggtcga ctgaaaatga gacatattat ctgccacgga ggtgttatta ccgaagaaat 102 0 
ggccgccagt cttttggacc agctgatcga agaggtactg gctgataatc ttccacctcc 10 80 
tagccatttt gaaccaccta cccttcacga actgtatgat ttagacgtga cggcccccga 1140 
agatcccaac gaggaggcgg tttcgcagat ttttcccgac tctgtaatgt tggcggtgca 120 0 ~- 
ggaagggatt gacttactca cttttccgcc ggcgcccggt tctccggagc cgcctcacct 1260 
ttcccggcag cccgagcagc cggagcagag agccttgggt ccggtttcta tgccaaacct 132 0 
tgtaccggag gtgatcgatc ttacctgcca cgaggctggc tttccaccca gtgacgacga 1380 
ggatgaagag ggtgaggagt ttgtgttaga ttatgtggag caccccgggc acggttgcag 1440 
gtcttgtcat tatcaccgga ggaatacggg ggacccagat attatgtgtt cgctttgcta 15 0 0 
tatgaggacc tgtggcatgt ttgtctacag taagtgaaaa ttatgggcag tgggtgatag 1560 
agtggtgggt ttggtgtggt aatttttttt ttaattttta cagttttgtg gtttaaagaa 162 0 
ttttgtattg tgattttttt aaaaggtcct gtgtctgaac ctgagcctga gcccgagcca 1680 
gaaccggagc ctgcaagacc tacccgccgt cctaaaatgg cgcctgctat cctgagacgc 1740 
ccgacatcac ctgtgtctag agaatgcaat agtagtacgg atagctgtga ctccggtcct 1800 
tctaacacac ctcctgagat acacccggtg gtcccgctgt gccccattaa accagttgcc 1860 
gtgagagttg gtgggcgtcg ccaggctgtg gaatgtatcg aggacttgct taacgagcct 192 0 
gggcaacctt tggacttgag ctgtaaacgc cccaggccat aaggtgtaaa cctgtgattg 1980 
cgtgtgtggt taacgccttt gtttgctgaa tgagttgatg taagtttaat aaagggtgag 2 040 
ataatgttta acttgcatgg cgtgttaaat ggggcggggc ttaaagggta tataatgcgc 2100 
cgtgggctaa tcttggttac atctgacctc atggaggctt gggagtgttt ggaagatttt 2160 
tctgctgtgc gtaacttgct ggaacagagc tctaacagta cctcttggtt ttggaggttt 222 0 
ctgtggggct catcccaggc aaagttagtc tgcagaatta aggaggatta caagtgggaa 2280 
tttgaagagc ttttgaaatc ctgtggtgag ctgtttgatt ctttgaatct gggtcaccag 234 0 
gcgcttttcc aagagaaggt catcaagact ttggattttt ccacaccggg gcgcgctgcg 24 0 0 
gctgctgttg cttttttgag ttttataaag gataaatgga gcgaagaaac ccatctgagc 2460 
9gggggtacc tgctggattt tctggccatg catctgtgga gagcggttgt gagacacaag 252 0 
aatcgcctgc tactgttgtc ttccgtccgc ccggcgataa taccgacgga ggagcagcag 2580 
cagcagcagg aggaagccag gcggcggcgg caggagcaga gcccatggaa cccgagagcc 2 64 0 
ggcctggacc ctcgggaatg aatgttgtac aggtggctga actgtatcca gaactgagac 270 0 
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gcattttgac aattacagag gatgggcagg 
cttgtgaggc tacagaggag gctaggaatc 
ctgagtgtat tacttttcaa cagatcaagg 
cgcagaagta ttccatagag cagctgacca 
aggaggctat tagggtatat gcaaaggtgg 
gcaaacttgt aaatatcagg aattgttgct 
tagatacgga ggatagggtg gcctttagat 
ttggcatgga cggggtggtt attatgaatg 
cggttttcct ggccaatacc aaccttatcc 
atacctgtgt ggaagcctgg accgatgtaa 
ggaagggggt ggtgtgtcgc cccaaaagca 
ggtgtacctt gggtatcctg tctgagggta 
actgtggttg cttcatgcta gtgaaaagcg 
gcaactgcga ggacagggcc tctcagatgc 
tgaagaccat tcacgtagcc agccactctc 
tactgacccg ctgttccttg catttgggta 
gcaatttgag tcacactaag atattgcttg 
acggggtgtt tgacatgacc atgaagatct 
ccaggtgcag accctgcgag tgtggcggta 
atgtgaccga ggagctgagg cccgatcact 
gctctagcga tgaagataca gattgaggta 
gaaagaatat ataaggtggg ggtcttatgt 
ccgccatgag caccaactcg tttgatggaa 
tgcccccatg ggccggggtg cgtcagaatg 
tcctgcccgc aaactctact accttgacct 
ctgcagcctc cgccgccgct tcagccgctg 
ttgctttcct gagcccgctt gcaagcagtg 
agttgacggc tcttttggca caattggatt 
agcagctgtt ggatctgcgc cagcaggttt 
cggtttaaaa cataaataaa aaaccagact 
gctgtctcag ctgactgctt aagtcgcaag 
ttaaagcaga acttgtttat tgcagcttat 
aatttcacaa ataaagcatt tttttcactg 
aatgtatctt atcatgtctg gtcgactcta 
ctgcgctcgg tcgttcggct gcggcgagcg 
ttatccacag aatcagggga taacgcagga 
gccaggaacc gtaaaaaggc cgcgttgctg 
gagcatcaca aaaatcgacg ctcaagtcag 
taccaggcgt ttccccctgg aagctccctc 
accggatacc tgtccgcctt tctcccttcg 
tgtaggtatc tcagttcggt gtaggtcgtt 
cccgttcagc ccgaccgctg cgccttatcc 
agacacgact tatcgccact ggcagcagcc 
gtaggcggtg ctacagagtt cttgaagtgg 
gtatttggta tctgcgctct gctgaagcca 
tgatccggca aacaaaccac cgctggtagc 
acgcgcagaa aaaaaggatc tcaagaagat 
cagtggaacg aaaactcacg ttaagggatt 
acctagatcc ttttaaatta aaaatgaagt 
acttggtctg acagttacca atgcttaatc 
tttcgttcat ccatagttgc ctgactcccc 
ttaccatctg gccccagtgc tgcaatgata 
ttatcagcaa taaaccagcc agccggaagg 
tccgcctcca tccagtctat taattgttgc 
aatagtttgc gcaacgttgt tgccattgct 
ggtatggctt cattcagctc cggttcccaa 
ttgtgcaaaa aagcggttag ctccttcggt 
gcagtgttat cactcatggt tatggcagca 
gtaagatgct tttctgtgac tggtgagtab 
cggcgaccga gttgctcttg cccggcgtca 
actttaaaag tgctcatcat tggaaaacgt 



ggctaaaggg ggtaaagagg gagcgggggg 2 76 0 
tagcttttag cttaatgacc agacaccgtc 2 82 0 
ataattgcgc taatgagctt gatctgctgg 2 88 0 
cttactggct gcagccaggg gatgattttg 2 94 0 
cacttaggcc agattgcaag tacaagatca 3 0 00 
acatttctgg gaacggggcc gaggtggaga 3 06 0 
gtagcatgat aaatatgtgg ccgggggtgc 312 0 
taaggtttac tggccccaat tttagcggta 318 0 
tacacggtgt aagcttctat gggtttaaca 3240 
gggttcgggg ctgtgccttt tactgctgct 33 0 0 
gggcttcaat taagaaatgc ctctttgaaa 3360 
actccagggt gcgccacaat gtggcctccg 342 0 
tggctgtgat taagcataac atggtatgtg 34 8 0 
tgacctgctc ggacggcaac tgtcacctgc 3 54 0 
gcaaggcctg gccagtgttt gagcataaca 3 60 0 
acaggagggg ggtgttccta ccttaccaat 3 66 0 
agcccgagag catgtccaag gtgaacctga 3 72 0 
ggaaggtgct gaggtacgat gagacccgca 3 78 0 
aacatattag gaaccagcct gtgatgctgg 3 84 0 
tggtgctggc ctgcacccgc gctgagtttg 3 90 0 
ctgaaatgtg tgggcgtggc ttaagggtgg 3 960 
agttttgtat ctgttttgca gcagccgccg 4 02 0 
gcattgtgag ctcatatttg acaacgcgca 4 08 0 
tgatgggctc cagcattgat ggtcgccccg 414 0 
acgagaccgt gtctggaacg ccgttggaga 42 0 0 
cagccaccgc ccgcgggatt gtgactgact 4260 
cagcttcccg ttcatccgcc cgcgatgaca 432 0 
ctttgacccg ggaacttaat gtcgtttctc 43 80 
ctgccctgaa ggcttcctcc cctcccaatg 4440 
ctgtttggat ttggatcaag caagtgtctt 45 0 0 
ccgaattgga tccaattcgg atcgatctta 4560 
aatggttaca aataaagcaa tagcatcaca 4 62 0 
cattctagtt gtggtttgtc caaactcatc 4680 
gactcttccg cttcctcgct cactgactcg 4740 
gtatcagctc actcaaaggc ggtaatacgg 4 80 0 
aagaacatgt gagcaaaagg ccagcaaaag 4 860 
gcgtttttcc ataggctccg cccccctgac 4920 
aggtggcgaa acccgacagg actataaaga 498 0 
gtgcgctctc ctgttccgac cctgccgctt 5040 
ggaagcgtgg cgctttctca tagctcacgc 510 0 
cgctccaagc tgggctgtgt gcacgaaccc 5160 
ggtaactatc gtcttgagtc caacccggta 5220 
actggtaaca ggattagcag agcgaggtat 52 8 0 
tggcctaact acggctacac tagaaggaca 534 0 
gttaccttcg gaaaaagagt tggtagctct 5400 
ggtggttttt ttgtttgcaa gcagcagatt 54 6 0 
cctttgatct tttctacggg gtctgacgct 552 0 
ttggtcatga gattatcaaa aaggatcttc 55 8 0 
tttaaatcaa tctaaagtat atatgagtaa 564 0 
agtgaggcac ctatctcagc gatctgtcta 570 0 
gtcgtgtaga taactacgat acgggagggc 5760 
ccgcgagacc cacgctcacc ggctccagat 5 82 0 
gccgagcgca gaagtggtcc tgcaacttta 58 8 0 
cgggaagcta gagtaagtag ttcgccagtt 594 0 
acaggcatcg tggtgtcacg ctcgtcgttt 60 0 0 
cgatcaaggc gagttacatg atcccccatg 606 0 
cctccgatcg ttgtcagaag taagttggcc 612 0 
ctgcataatt ctcttactgt catgccatcc 6180 
tcaaccaagt cattctgaga atagtgtatg 624 0 
atacgggata ataccgcgcc acatagcaga 63 00 
tcttcggggc gaaaactctc aaggatctta 6360 
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ccgctgttga gatccagttc gatgtaaccc 
tttactttca ccagcgtttc tgggtgagca 
ggaataaggg cgacacggaa atgttgaata 
agcatttatc agggttattg tctcatgagc 
aaacaaatag gggttccgcg cacatttccc 
attattatca tgacattaac ctataaaaat 
gcgtttcggt gatgacggtg aaaacctctg 
ttgtctgtaa gcggatgccg ggagcagaca 
cgggtgtcgg ggctggctta actatgcggc 
tatgcggtgt gaaataccgc acagatgcgt 
agcgttaata ttttgttaaa attcgcgtta 
caataggccg aaatcggcaa aatcccttat 
agtgttgttc cagtttggaa caagagtcca 
gggcgaaaaa ccgtctatca gggcgatggc 
tttttggggt cgaggtgccg taaagcacta 
agagcttgac ggggaaagcc ggcgaacgtg 
gcgggcgcta gggcgctggc aagtgtagcg 
gcgcttaatg cgccgctaca gggcgcgtcc 
gaagggcgat cggtgcgggc ctcttcgcta 
gcaaggcgat taagttgggt aacgccaggg 
gccagtgaat tgtaatacga ctcactatag 

<210> 42 
<211> 11600 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Plasmid MMTV-E2a-SV4 0-Neo 



actcgtgcac ccaactgatc ttcagcatct 642 0 
aaaacaggaa ggcaaaatgc cgcaaaaaag 64 8 0 
ctcatactct tcctttttca atattattga 6540 
ggatacatat ttgaatgtat ttagaaaaat 66 00 
cgaaaagtgc cacctgacgt ctaagaaacc 6660 
aggcgtatca cgaggcccct ttcgtctcgc 672 0 
acacatgcag ctcccggaga cggtcacagc 6780 
agcccgtcag ggcgcgtcag cgggtgttgg 684 0 
atcagagcag attgtactga gagtgcacca 690 0 
aaggagaaaa taccgcatca ggaaattgta 6960 
aatttttgtt aaatcagctc attttttaac 7020 
aaatcaaaag aatagaccga gatagggttg 70 80 
ctattaaaga acgtggactc caacgtcaaa 7140 
ccactacgtg aaccatcacc ctaatcaagt 72 00 
aatcggaacc ctaaagggag cccccgattt 7260 
gcgagaaagg aagggaagaa agcgaaagga 732 0 
gtcacgctgc gcgtaaccac cacacccgcc 73 80 
cattcgccat tcaggctgcg caactgttgg 744 0 
ttacgccagc tggcgaaagg gggatgtgct 75 0 0 
ttttcccagt cacgacgttg taaaacgacg 7560 
ggcgaattaa ttcgggg 7607 



<400> 42 

gaattccgca ttgcagagat attgtattta agtgcctagc tcgatacaat aaacgccatt 6 0 
tgaccattca ccacattggt gtgcacctcc aagcttgggc agaaatggtt gaactcccga 12 0 
gagtgtccta cacctagggg agaagcagcc aaggggttgt ttcccaccaa ggacgacccg 180 
tctgcgcaca aacggatgag cccatcagac aaagacatat tcattctctg ctgcaaactt 240 
ggcatagctc tgctttgcct ggggctattg ggggaagttg cggttcgtgc tcgcagggct 30 0 
ctcacccttg actcttttaa tagctcttct gtgcaagatt acaatctaaa caattcggag 360 
aactcgacct tcctcctgag gcaaggacca cagccaactt cctcttacaa gccgcatcga 42 0 
ttttgtcctt cagaaataga aataagaatg cttgctaaaa attatatttt taccaataag 480 
accaatccaa taggtagatt attagttact atgttaagaa atgaatcatt atcttttagt 540 
actattttta ctcaaattca gaagttagaa atgggaatag aaaatagaaa gagacgctca 60 0 
acctcaattg aagaacaggt gcaaggacta ttgaccacag gcctagaagt aaaaaaggga 660 
aaaaagagtg tttttgtcaa aataggagac aggtggtggc aaccagggac ttatagggga 72 0 
ccttacatct acagaccaac agatgccccc ttaccatata caggaagata tgacttaaat 78 0 
tgggataggt gggttacagt caatggctat aaagtgttat atagatccct cccttttcgt 840 
gaaagactcg ccagagctag acctccttgg tgtatgttgt ctcaagaaga aaaagacgac 90 0 
atgaaacaac aggtacatga ttatatttat ctaggaacag gaatgcactt ttggggaaag 9 60 
attttccata ccaaggaggg gacagtggct ggactaatag aacattattc tgcaaaaact 102 0 
catggcatga gttattatga atagccttta ttggcccaac cttgcggttc ccagggctta 10 8 0 
agtaagtttt tggttacaaa ctgttcttaa aacgaggatg tgagacaagt ggtttcctga 114 0 
cttggtttgg tatcaaaggt tctgatctga gctctgagtg ttctattttc ctatgttctt 120 0 
ttggaattta tccaaatctt atgtaaatgc ttatgtaaac caagatataa aagagtgctg 12 6 0 
attttttgag taaacttgca acagtcctaa cattcacctc ttgtgtgttt gtgtctgttc 132 0 
gccatcccgt ctccgctcgt cacttatcct tcactttcca gagggtcccc ccgcagaccc 13 8 0 
cggcgaccct caggtcggcc gactgcggca gctggcgccc gaacagggac cctcggataa 144 0 
gtgacccttg tctctatttc tactatttgg tgtttgtctt gtattgtctc tttcttgtct 1500 
ggctatcatc acaagagcgg aacggactca ccatagggac caagctagcg cttctcgtcg 156 0 
cgtccaagac cctcaaagat ttttggcact tcgttgagcg aggcgatatc aggtatgaca 162 0 
gcgccctgcc gcaaggccag ctgcttgtcc gctcggctgc ggttggcacg gcaggatagg 168 0 
ggtatcttgc agttttggaa aaagatgtga taggtggcaa gcacctctgg cacggcaaat 174 0 
ac 9gggtaga agttgaggcg cgggttgggc tcgcatgtgc cgttttcttg gcgtttgggg 1800 
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ggtacgcgcg 
ggcacatcgc 
ttcaacagca 
acttgttcct 
gagcggtcct 
tcctcaagcg 
gcggtggtgg 
atctttttct 
acccccgagc 
ccgtccccgt 
gagtccgagg 
cggccatcga 
tctcccaagc 
gaagatgtgg 
aaaggaggta 
cggacgcaag 
ccgctgagtg 
atggacaagt 
gtggaagctc 
ctgaccttca 
ctgcagtcgt 
ctgcaccgct 
aataaggagc 
gage agt eta 
tccaacaccg 
ggcaagtctt 
atcaaggctt 
atgccactac 
ctaccaaagt 
tccgacaaga 
aaccctgtgt 
tcggcgcccg 
accgagctgc 
aacgtgtccc 
gegcagaegg 
cctttattga 
gtggcgctcc 
ctgtagagct 
gagttgggta 
cagttggcaa 
ccctgtacgc 
gcgctacggc 
gaaaagccaa 
atccagtgta 
tgtggagaaa 
ccaagatctt 
gttgatgttg 
acaeggttte 
acaaaacata 
ttacctcagg 
aattggacaa 
tataatgtgt 
ctgatgaatg 
aaatgecate 
agaagagaaa 
atgctgtgtt 
ctgcactgct 
acagttataa 
ttaataacta 
aggaatattt 
gaggttttac 



gtgagaatag 
tgcgctcttg 
cgtcgtctcc 
cgtttgcctc 
cgtcgtcttc 
ggggtgcctc 
cgaactcaga 
gectatagga 
geggacgegg 
cgccgccgcc 
acgaggaaga 
cctcggcggc 
ccgagcgccc 
cgctacaaat 
agegcacagt 
aggaagagga 
tgccgatcgt 
accacgtgga 
tggcggccgt 
ccagcaacaa 
ttgcagaggt 
gegctgagat 
acgtgattga 
geaaggecaa 
aegcaaggtg 
gcggcatgtt 
ttatgeagge 
ggtgcgagtg 
tgactccgtt 
gcgtgctggc 
atcgcaactc 
acctgetaaa 
cgcggatggt 
tgccagtggc 
caagggtggg 
aagtgtctct 
taatctgege 
gttcctggtt 
ccccggtaat 
aggcgtggag 
tttgggtgga 
gcaggagtga 
agggctcaaa 
cgcccccagt 
caaagectgg 
tgacaatggc 
ettgettett 
gatgacgecg 
aagaagggtg 
tcgacctcga 
actacctaca 
taaactactg 
ggagcagtgg 
tagtgatgat 
ggtagaagac 
tagtaataga 
atacaagaaa 
tcataacata 
tgctcaaaaa 
gatgtatagt 
ttgctttaaa 



gtggcgttcg 
caacgcgtcg 
cacatctagg 
tgcgttgtcc 
gcttacaaaa 
gaeggggaag 
gggggcggtt 
gaaggaaatg 
tgeggegega 
tccccgggcg 
ctcatcacaa 
ggatttggcc 
gccatcacca 

ggtgggtttc 
gcggcggctg 
agagcccagc 
gtctgcgtgg 
taacgatcta 
atgeaagace 
gacctttgtg 
gacctacaag 
egaaggegag 
aatggatgtg 
gatcgtgaag 
ctgcgtgcac 
cttctctgaa 
gctgtatcct 
caactcaaag 
cgccctgagc 
cagcgtgcac 
gcgcgcgcag 
cgcgttggtg 
tgtgcctgag 
geatagegat 
ggtaaataat 
agtacattat 
actgtggctg 
gegaegcagg 
aaggttcatg 
aaacatgcag 
cttttccagc 
ctcgtactca 
gaggtagcat 
ctcgcgaccg 
aaagcgcttg 
tttcagttcc 
tatgttgtgg 
cggtgcggcc 
ggctcgtcca 
gggatctttg 
gagatttaaa 
attctaattg 
tggaatgect 
gaggctactg 
cccaaggact 
actcttgett 
attatggaaa 
ctgttttttc 
ttgtgtacct 
gecttgacta 
aaacctccca 



taggcaaggc 
cagataatgg 
tagtcgecat 
tggtcttget 
cctgggtcct 
gtggtaggcg 
aggctgtcct 
gecagteggg 
cgtcccccaa 
cccccaaaaa 
gaegegctgg 
attgcgccca 
gaggtaatcg 
agcaacccac 
aatgaagacg 
gaagcggaaa 
gagaagggca 
aaggegaact 
tggctgaacg 
acgatgatgg 
catcacgagc 
cttaagtgtc 
aegagegaaa 
aaccggtggg 
gacgcggcct 
ggegcaaagg 
aacgcccaga 
cctgggcacg 
aacgeggagg 
cacccggcgc 
ggeggaggee 
atggtgcgca 
tttaagtgga 
gegeggcaga 
cacccgagag 
ttttacatgt 
eggaagtagg 
gtgggctgta 

gtggggttgt 

cagaatagtc 
gttatacagc 
aactggtaaa 
gtttttgagt 
geegtattga 
tcataggtgc 
tgctcactgg 
cgttgccggc 
ggtgcacacg 
tgggatccat 
tgaaggaacc 
gctctaaggt 
tttgtgtatt 
ttaatgagga 
ctgactctca 
ttccttcaga 
getttgetat 
aatattctgt 
ttactccaca 
ttagcttttt 
gagatcataa 
cacctccccc 



tgacatccgc 
cgcactggcg 
gcctttcgtc 
ttttatcctc 
gctcgataat 
cgttggcggc 
tcttctcgac 
aagaggagca 
ccatggagga 
ageggatgag 
tgccgcgcac 
agaagaaaaa 
tggacagega 
cggtgctaat 
acccagtggc 
gtgaaattac 
tggaggctgc 
tcaaactact 
aggagcaccg 
ggegattect 
ccacgggctg 
tacaeggaag 
aegggcageg 
gecgaaatgt 
gtccggccaa 
ctcaggtggc 
ccgggcacgg 
cgcccttttt 
acctggacgc 
tgatagtgtt 
ccaactgcga 
gcctgtggag 
gcactaaaca 
acccctttga 
tgtacaaata 
ttttcaagtg 
gcgagtggcg 
cctggggact 
gatccatggg 
cacaggcggc 
ggtcggggga 
ectgettgag 
gcgggttcca 
etatggegea 
ccaaaaaata 
agcccatggc 
cgagaagggc 
gaccacgtca 
atatagggee 
ttacttctgt 
aaatataaaa 
ttagattcca 
aaacctgttt 
acattctact 
attgetaagt 
ttacaccaca 
aacctttata 
caggcataga 
aatttgtaaa 
tcagccatac 
tgaacctgaa 



tatggegagg 
ctgcagatgc 
cccccgcccg 
tgttggtact 
cacttcctcc 
atcggtggag 
tgactccatg 
gcgcgaaacc 
cgtgtcgtcc 
geggegtate 
acccagcccg 
gaagcgccct 
ggaagaaaga 
caagcatggc 
gcgtggtatg 
ggtgatgaac 
gcgcgcgctg 
gcctgaccaa 
cgggttgcag 
geaggegtae 
cgcgttgtgg 
cattatgata 
cgcgctgaag 
ggtgcagatc 
tcagttttcc 
ttttaagcag 
tcaccttttg 
gggaaggcag 
ggatctgatc 
ccagtgctgc 
cttcaagata 
tgaaaacttc 
ccagtatcgc 
tttttaaacg 
aaagcatttg 
acaaaaagaa 
ctccaggaag 
gttgagcatg 
agtttggggc 
cgagttgggc 
agaagcaatg 
tegctggtea 
ggcaaaggee 
ggcgagcttg 
tggcccacaa 
ggcagctgtt 
gtgcgcaggt 
aagacttcaa 
cgggttataa 
ggtgtgacat 
tttttaagtg 
acctatggaa 
tgctcagaag 
cctccaaaaa 
tttttgagtc 
aaggaaaaag 
agtaggcata 
gtgtctgcta 
gggdjttaata 
cacatttgta 
acataaaatg 



1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260- 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
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aatgcaattg 
agcatcacaa 
aaactcatca 
tgtggaaagt 
tcagcaacca 
catctcaatt 
ccgcccagtt 
gccgaggccg 
ctaggctttt 
ggaagcggaa 
gctactgggc 
gtgggcttac 
tgccagctgg 
tcttgccgcc 
gatcgtttcg 
agaggctatt 
tccggctgtc 
tgaatgaact 
gcgcagctgt 
tgccggggca 
ctgatgcaat 
cgaaacatcg 
atctggacga 
gcatgcccga 
tggtggaaaa 
gctatcagga 
ctgaccgctt 
atcgccttct 
gacgcccaac 
cttcggaatc 
ggagttcttc 
gctggacgac 
cagcggctat 
tttggtcccg 
tacctacaga 
aactactgat 
agcagtggtg 
gtgatgatga 
tagaagaccc 
gtaatagaac 
acaagaaaat 
ataacatact 
ctcaaaaatt 
tgtatagtgc 
gctttaaaaa 
gttgttaact 
ttcacaaata 
gtatcttatc 
ctctacttga 
aattaggtca 
ggtaatttta 
cagcccacaa 
accctgctca 
attttcccca 
aacccagcac 
catctgctga 
cctgtagttt 
aaaatttgac 
atgcaagttt 
catcaaaata 
cgaaagggcc 



ttgttgttaa 
atttcacaaa 
atgtatctta 
ccccaggctc 
ggtgtggaaa 
agtcagcaac 
ccgcccattc 
cctcggcctc 
gcaaaaagct 
cacgtagaaa 
tatctggaca 
atggcgatag 
ggcgccctct 
aaggatctga 
catgattgaa 
cggctatgac 
agcgcagggg 
gcaggacgag 
gctcgacgtt 
ggatctcctg 
gcggcggctg 
catcgagcga 
agagcatcag 
cggcgaggat 
tggccgcttt 
catagcgttg 
cctcgtgctt 
tgacgagttc 
ctgccatcac 
gttttccggg 
gcccaccccg 
ctcgcggagt 
ccgcgcatcc 
gatctttgtg 
gatttaaagc 
tctaattgtt 
gaatgccttt 
ggctactgct 
caaggacttt 
tcttgcttgc 
tatggaaaaa 
gttttttctt 
gtgtaccttt 
cttgactaga 
acctcccaca 
tgtttattgc 
aagcattttt 
atgtctggat 
gaggacattc 
cttaacaaaa 
aaatatctgg 
atgtcaacag 
tcaagaagca 
cctgtgtagg 
tccactggat 
ctgtcaactg 
gctaacacac 
ccttgaatgg 
aacatagcag 
tttccacagg 
tcgtgatacg 



cttgtttatt 
taaagcattt 
tcatgtctgg 
cccagcaggc 
gtccccaggc 
catagtcccg 
tccgccccat 
tgagctattc 
tcacgctgcc 
gccagtccgc 
agggaaaacg 
ctagactggg 
ggtaaggttg 
tggcgcaggg 
caagatggat 
tgggcacaac 
cgcccggttc 
gcagcgcggc 
gtcactgaag 
tcatctcacc 
catacgcttg 
gcacgtactc 
gggctcgcgc 
ctcgtcgtga 
tctggattca 
gctacccgtg 
tacggtatcg 
ttctgagcgg 
gagatttcga 
acgccggctg 
ggctcgatcc 
tctaccggca 
atgcccccga 
aaggaacctt 
tctaaggtaa 
tgtgtatttt 
aatgaggaaa 
gactctcaac 
ccttcagaat 
tttgctattt 
tattctgtaa 
actccacaca 
agctttttaa 
gatcataatc 
cctccccctg 
agcttataat 
ttcactgcat 
ccccaggaag 
caatcatagg 
aggaaattgg 
gaagtccctt 
cagaaacata 
ctgtggttgc 
ttccaaaata 
aagcattatc 
tagcattttt 
cctgcagctc 
gttttccagc 
ttaccccaat 
ttaagtcctc 
cctattttta 



gcagcttata 
ttttcactgc 
atccggctgt 
agaagtatgc 
tccccagcag 
cccctaactc 
ggctgactaa 
cagaagtagt 
gcaagcactc 
agaaacggtg 
caagcgcaaa' 
cggttttatg 
ggaagccctg 
gatcaagatc 
tgcacgcagg 
agacaatcgg 
tttttgtcaa 
tatcgtggct 
cgggaaggga 
ttgctcctgc 
atccggctac 
ggatggaagc 
cagccgaact 
cccatggcga 
tcgactgtgg 
atattgctga 
ccgctcccga 
gactctgggg 
ttccaccgcc 
gatgatcctc 
cctcgcgagt 
gtgcaaatcc 
actgcaggag 
acttctgtgg 
atataaaatt 
agattccaac 
acctgttttg 
attctactcc 
tgctaagttt 
acaccacaaa 
cctttataag 
ggcatagagt 
tttgtaaagg 
agccatacca 
aacctgaaac 
ggttacaaat 
tctagttgtg 
ctcctctgtg 
ctgcccatcc 
gtaggggttt 
ccactgctgt 
caagctgtca 
tgtgttagta 
tctagtgttt 
cttatccaaa 
tggggttaca 
caaaggttcc 
accattttca 
aacctcagtt 
atttaaatta 
taggttaatg 



atggttacaa 
attctagttg 
ggaatgtgtg 
aaagcatgca 
gcagaagtat 
cgcccatccc 
ttttttttat 
gaggaggctt 
agggcgcaag 
ctgaccccgg 
gagaaagcag 
gacagcaagc 
caaagtaaac 
tgatcaagag 
ttctccggcc 
ctgctctgat 
gaccgacctg 
ggccacgacg 
ctggctgcta 
cgagaaagta 
ctgcccattc 
cggtcttgtc 
gttcgccagg 
tgcctgcttg 
ccggctgggt 
agagcttggc 
ttcgcagcgc 
ttcgaaatga 
gccttctatg 
cagcgcgggg 
tggttcagct 
gtcggcatcc 
tggggaggca 
tgtgacataa 
tttaagtgta 
ctatggaact 
ctcagaagaa 
tccaaaaaag 
tttgagtcat 
ggaaaaagct 
taggcataac 
gtctgctatt 
ggttaataag 
catttgtaga 
ataaaatgaa 
aaagcaatag 
gtttgtccaa 
tcctcataaa 
accctctgtg 
ttcacagacc 
gttccagaag 
gctttgcaca 
atgtgcaaaa 
tcatttttac 
acagccttgt 
gtttgagcag 
ccaccaacag 
tgagtttttt 
ttaacagtaa 
ggcaaaggaa 
tcatgataat 



ataaagcaat 
tggtttgtcc 
tcagttaggg 
tctcaattag 
gcaaagcatg 
gcccctaact 
ttatgcagag 
ttttggaggc 
ggctgctaaa 
atgaatgtca 
gtagcttgca 
gaaccggaat 
tggatggctt 
acaggatgag 
gcttgggtgg 
gccgccgtgt 
tccggtgccc 
ggcgttcctt 
ttgggcgaag 
tccatcatgg 
gaccaccaag 
gatcaggatg 
ctcaaggcgc 
ccgaatatca 
gtggcggacc 
ggcgaatggg 
atcgccttct 
ccgaccaagc 
aaaggttggg 
atctcatgct 
gctgcctgag 
aggaaaccag 
cgatggccgc 
ttggacaaac 
taatgtgtta 
gatgaatggg 
atgccatcta 
aagagaaagg 
gctgtgttta 
gcactgctat 
agttataatc 
aataactatg 
gaatatttga 
ggttttactt 
tgcaattgtt 
catcacaaat 
actcatcaat 
ccctaacctc 
tcctcctgtt 
gctttctaag 
tgttggtaaa 
agggcccaac 
caggaggcac 
ttggatcagg 
ggtcagtgtt 
gatatttggt 
caaaaaaatg 
gtgtccctga 
cagcttccca 
ttcttgaaga 
aatggtttct 



5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
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tagacgtcag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc 9180 
taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa 924 0 
tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt 93 0 0 
gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct 93 60 
gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc 942 0 
cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta 9480 
tgtggcgcgg tattatcccg tgttgacgcc gggcaagagc aactcggtcg ccgcatacac 9540 
tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc 960 0 
atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac 9660 
ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg 972 0 
gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac 9780 
gagcgtgaca ccacgatgcc tgcagcaatg gcaacaacgt tgcgcaaact attaactggc 9840 
gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt 9900 
gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga 9960 
gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc 10 02 0 
cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag 10080 
atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca 1014 0 
tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc 10200 
ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca 10260 
gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc 10320 
tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta 103 80 
ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt 10440 
ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc 10500 
gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg 10560 
ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg 10620 
tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag 10680 
ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc 10 74 0 
agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat 10800 
agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg 10860 
gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc 1092 0 
tggccttttg ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt 10980 
accgcctttg agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca 11040 
gtgagcgagg aagcggaaga gcgcctgatg cggtattttc tccttacgca tctgtgcggt 11100 
atttcacacc gcatatggtg cactctcagt acaatctgct ctgatgccgc atagttaagc 11160 
cagtatctgc tccctgcttg tgtgttggag gtcgctgagt agtgcgcgag caaaatttaa 1122 0 
gctacaacaa ggcaaggctt gaccgacaat tgcatgaaga atctgcttag ggttaggcgt 112 80 
tttgcgctgc ttcgcgatgt acgggccaga tatacgcgta tctgagggga ctagggtgtg 11340 
tttaggcgaa aagcggggct tcggttgtac gcggttagga gtcccctcag gatatagtag 114 00 
tttcgctttt gcatagggag ggggaaatgt agtcttatgc aatacacttg tagtcttgca 114 60 
acatggtaac gatgagttag caacatgcct tacaaggaga gaaaaagcac cgtgcatgcc 11520 
gattggtgga agtaaggtgg tacgatcgtg ccttattagg aaggcaacag acgggtctga 115 80 
catggattgg acgaaccact 116 00 

<210> 43 
<211> 35211 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Plasmid AvlnBg 
<400> 43 

catcatcaat aatatacctt attttggatt gaagccaata tgataatgag ggggtggagt 60 
ttgtgacgtg gcgcggggcg tgggaacggg gcgggtgacg tagtagtgtg gcggaagtgt 120 
gatgttgcaa gtgtggcgga acacatgtaa gcgacggatg tggcaaaagt gacgtttttg 180 
gtgtgcgccg gtgtacacag gaagtgacaa ttttcgcgcg gttttaggcg gatgttgtag 240 
taaatttggg cgtaaccgag taagatttgg ccattttcgc gggaaaactg aataagagga 3 00 
agtgaaatct gaataatttt gtgttactca tagcgcgtaa tatttgtcta gggccgcggg 360 
gactttgacc gtttacgtgg agactcgccc aggtgttttt ctcaggtgtt ttccgcgttc 42 0 
cgggtcaaag ttggcgtttt attattatag tcagtacgta ccagtgcact ggcctaggaa 4 80 
gcttggtacc ggtgaattcg ctagcgttcg cgccccgatg tacgggccag atatacgcgt 540 
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atctgagggg actagggtgt gtttaggcga 
agtcccctca ggatatagta gtttcgcttt 
caatactctt gtagtcttgc aacatggtaa 
agaaaaagca ccgtgcatgc cgattggtgg 
gaaggcaaca gacgggtctg acatggattg 
tattgtattt aagtgcctag ctcgatacaa 
tgtgcacctc cggccctggc cactctcttc 
ggctcgcggt tgaggacaaa ctcttcgcgg 
tcggcctccg aacggtactc cgccgccgag 
ggaaaacctc tcgagaaagg cgtgtaacca 
ccgggctgca ggaattcgat ctagatggat 
gaccccaagg actttccttc agaattgcta 
ttacaacgtc gtgactggga aaaccctggc 
ccccctttcg ccagctggcg taatagcgaa 
ttgcgcagcc tgaatggcga atggcgcttt 
gaaagctggc tggagtgcga tcttcctgag 
cagatgcacg gttacgatgc gcccatctac 
ccgccgtttg ttcccacgga gaatccgacg 
gaaagctggc tacaggaagg ccagacgcga 
catctgtggt gcaacgggcg ctgggtcggt 
tttgacctga gcgcattttt acgcgccgga 
tggagtgacg gcagttatct ggaagatcag 
gacgtctcgt tgctgcataa accgactaca 
tttaatgatg atttcagccg cgctgtactg 
cgtgactacc tacgggtaac agtttcttta 
accgcgcctt tcggcggtga aattatcgat 
ctacgtctga acgtcgaaaa cccgaaactg 
gcggtggttg aactgcacac cgccgacggc 
ggtttccgcg aggtgcggat tgaaaatggt 
attcgaggcg ttaaccgtca cgagcatcat 
acgatggtgc aggatatcct gctgatgaag 
cattatccga accatccgct gtggtacacg 
gatgaagcca atattgaaac ccacggcatg 
cgctggctac cggcgatgag cgaacgcgta 
ccgagtgtga tcatctggtc gctggggaat 
ctgtatcgct ggatcaaatc tgtcgatcct 
gccgacacca cggccaccga tattatttgc 
cccttcccgg ctgtgccgaa atggtccatc 
cgcccgctga tcctttgcga atacgcccac 
aaatactggc aggcgtttcg tcagtatccc 
gtggatcagt cgctgattaa atatgatgaa 
gattttggcg atacgccgaa cgatcgccag 
cgcacgccgc atccagcgct gacggaagca 
ttatccgggc aaaccatcga agtgaccagc 
ctcctgcact ggatggtggc gctggatggt 
gatgtcgctc cacaaggtaa acagttgatt 
gccgggcaac tctggctcac agtacgcgta 
gccgggcaca tcagcgcctg gcagcagtgg 
cccgccgcgt cccacgccat cccgcatctg 
ctgggtaata agcgttggca atttaaccgc 
ggcgataaaa aacaactgct gacgccgctg 
aacgacattg gcgtaagtga agcgacccgc 
aaggcggcgg gccattacca ggccgaagca 
gctgatgcgg tgctgattac gaccgctcac 
atcagccgga aaacctaccg gattgatggt 
gaagtggcga gcgatacacc gcatccggcg 
gtagcagagc gggtaaactg gctcggatta 
actgccgcct gttttgaccg ctgggatctg 
ttcccgagcg aaaacggtct gcgctgcggg 
tggcgcggcg acttccagtt caacatcagc 
agccatcgcc atctgctgca cgcggaagaa 



aaagcggggc ttcggttgta cgcggttagg 6 00 
tgcataggga gggggaaatg tagtcttatg 6 60 
cgatgagtta gcaacatgcc ttacaaggag 72 0 
aagtaaggtg gtacgatcgt gccttattag 7 80 
gacgaaccac tgaattccgc attgcagaga 84 0 
taaacgccat ttgaccattc accacattgg 90 0 
cgcatcgctg tctgcggggg ccagctgttg 960 
tctttccagt actcttggat cggaaacccg 102 0 
ggacctgagc gagtccgcat cgaccggatc 108 0 
gtcacagtcg ctctagaact agtggatccc 114 0 
aaaggtccaa aaaagaagag aaaggtagaa 12 0 0 
agttttttga gtgattcact ggccgtcgtt 1260 
gttacccaac ttaatcgcct tgcagcacat 1320 
gaggcccgca ccgatcgccc ttcccaacag 13 8 0 
gcctggtttc cggcaccaga agcggtgccg 144 0 
gccgatactg tcgtcgtccc ctcaaactgg 150 0 
accaacgtaa cctatcccat tacggtcaat 1560, 
ggttgttact cgctcacatt taatgttgat 162 0 
attatttttg atggcgttaa ctcggcgttt 1680 
tacggccagg acagtcgttt gccgtctgaa 174 0 
gaaaaccgcc tcgcggtgat ggtgctgcgt 180 0 
gatatgtggc ggatgagcgg cattttccgt 186 0 
caaatcagcg atttccatgt tgccactcgc 192 0 
gaggctgaag ttcagatgtg cggcgagttg 198 0 . 
tggcagggtg aaacgcaggt cgccagcggc 2 04 0 
gagcgtggtg gttatgccga tcgcgtcaca 210 0 
tggagcgccg aaatcccgaa tctctatcgt 2160 
acgctgattg aagcagaagc ctgcgatgtc 2 22 0 
ctgctgctgc tgaacggcaa gccgttgctg 2 28 0 
cctctgcatg gtcaggtcat ggatgagcag 2 34 0 
cagaacaact ttaacgccgt gcgctgttcg 240 0 
ctgtgcgacc gctacggcct gtatgtggtg 24 6 0 
gtgccaatga atcgtctgac cgatgatccg 2 52 0 
acgcgaatgg tgcagcgcga tcgtaatcac 2 58 0 
gaatcaggcc acggcgctaa tcacgacgcg 2 64 0 
tcccgcccgg tgcagtatga aggcggcgga 2 70 0 
ccgatgtacg cgcgcgtgga tgaagaccag 2 760 
aaaaaatggc tttcgctacc tggagagacg 2 82 0 
gcgatgggta acagtcttgg cggtttcgct 2 88 0 
cgtttacagg gcggcttcgt ctgggactgg 2 94 0 
aacggcaacc cgtggtcggc ttacggcggt 3 00 0 
ttctgtatga acggtctggt ctttgccgac 3 060 
aaacaccagc agcagttttt ccagttccgt 3120 
gaatacctgt tccgtcatag cgataacgag 318 0 
aagccgctgg caagcggtga agtgcctctg 3 24 0 
gaactgcctg aactaccgca gccggagagc 33 0 0 
gtgcaaccga acgcgaccgc atggtcagaa 33 60 
cgtctggcgg aaaacctcag tgtgacgctc 342 0 
accaccagcg aaatggattt ttgcatcgag 3480 
cagtcaggct ttctttcaca gatgtggatt 3 54 0 
cgcgatcagt tcacccgtgc accgctggat 3 60 0 
attgacccta acgcctgggt cgaacgctgg 3 66 0 
gcgttgttgc agtgcacggc agatacactt 3 72 0 
gcgtggcagc atcaggggaa aaccttattt 3 78 0 
agtggtcaaa tggcgattac cgttgatgtt 3 84 0 
cggattggcc tgaactgcca gctggcgcag 3 90 0 
gggccgcaag aaaactatcc cgaccgcctt 3 96 0 
ccattgtcag acatgtatac cccgtacgtc 4 02 0 
acgcgcgaat tgaattatgg cccacaccag 4 08 0 
cgctacagtc aacagcaact gatggaaacc 414 0 
ggcacatggc tgaatatcga cggtttccat 42 0 0 
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atggggattg gtggcgacga ctcctggagc 
gccggtcgct accattacca gttggtctgg 
atcgataccg tcgaaacttg tttattgcag 
tcacaaattt cacaaataaa gcattttttt 
tcatcaatgt atcttatcat gtctggatcc 
atgagacccg caccaggtgc agaccctgcg 
ctgtgatgct ggatgtgacc gaggagctga 
gcgctgagtt tggctctagc gatgaagata 
gcttaagggt gggaaagaat atataaggtg 
cagcagccgc cgccgccatg agcaccaact 
tgacaacgcg catgccccca tgggccgggg 
atggtcgccc cgtcctgccc gcaaactcta 
cgccgttgga gactgcagcc tccgccgccg 
ttgtgactga ctttgctttc ctgagcccgc 
cccgcgatga caagttgacg gctcttttgg 
atgtcgtttc tcagcagctg ttggatctgc 
cccctcccaa tgcggtttaa aacataaata 
agcaagtgtc ttgctgtctt tatttagggg 
ggtctcggtc gttgagggtc ctgtgtattt 
tgttcagata catgggcata agcccgtctc 
catgctgcgg ggtggtgttg tagatgatcc 
taaaaatgtc tttcagtagc aagctgattg 
caaagcggtt aagctgggat gggtgcatac 
tttttaggtt ggctatgttc ccagccatat 
ccagcacagt gtatccggtg cacttgggaa 
ggaagaactt ggagacgccc ttgtgacctc 
tggcaatggg cccacgggcg gcggcctggg 
agttgtgttc aggatgagat cgtcataggc 
agactgcggt ataatggttc catccggccc 
ttcccacgct ttgagttcag atggggggat 
ggtttccggg gtaggggaga tcagctggga 
accgcagccg gtgggcccgt aaatcacacc 
gctgcagctg ccgtcatccc tgagcagggg 
catgttttcc ctgaccaaat ccgccagaag 
caaggaagca aagtttttca acggtttgag 
ttgaccaagc agttccaggc ggtcccacag 
cagcatatct cctcgtttcg cgggttgggg 
tcgtccagac gggccagggt catgtctttc 
tgggtcacgg tgaaggggtg cgctccgggc 
gtcctgctgg tgctgaagcg ctgccggtct 
accatggtgt catagtccag cccctccgcg 
gaggaggcgc cgcacgaggg gcagtgcaga 
aataccgatt ccggggagta ggcatccgcg 
acgagccagg tgagctctgg ccgttcgggg 
atgcgtttct tacctctggt ttccatgagc 
tccgtgtccc cgtatacaga cttgagaggc 
tcgtatagaa actcggacca ctctgagaca 
gctaagtggg aggggtagcg gtcgttgtcc 
agacacatgt cgccctcttc ggcatcaagg 
tgaccgggtg ttcctgaagg ggggctataa 
tcttccgcat cgctgtctgc gagggccagc 
ggcatgactt ctgcgctaag attgtcagtt 
tggcccgcgg tgatgccttt gagggtggcc 
ttgttgtcaa gcttggtggc aaacgacccg 
gagcgcaggg tttggttttt gtcgcgatcg 
acgtattcgc gcgcaacgca ccgccattcg 
aggtgcacgc gccaaccgcg gttgtgcagg 
ccgcgtaggc gctcgttggt ccagcagagg 
agggggtcta gctgcgtctc gtccgggggg 
aggcgcgcgt cgaagtagtc tatcttgcat 
cgggcggcaa gcgcgcgctc gtatgggttg 



ccgtcagtat cggcggaatt tcagctgagc 42 60 
tgtcaaaaat aataatctcg aatcaagctt 432 0 
cttataatgg ttacaaataa agcaatagca 4380 
cactgcattc tagttgtggt ttgtccaaac 444 0 
gacctcggat ctggaaggtg ctgaggtacg 450 0 
agtgtggcgg taaacatatt aggaaccagc 456 0 
ggcccgatca cttggtgctg gcctgcaccc 4 62 0 
cagattgagg tactgaaatg tgtgggcgtg 4 68 0 
ggggtcttat gtagttttgt atctgttttg 4 74 0 
cgtttgatgg aagcattgtg agctcatatt 4 80 0 
tgcgtcagaa tgtgatgggc tccagcattg 4 860 
ctaccttgac ctacgagacc gtgtctggaa 492 0 
cttcagccgc tgcagccacc gcccgcggga 4980 
ttgcaagcag tgcagcttcc cgttcatccg 5 04 0 
cacaattgga ttctttgacc cgggaactta 510 0 
gccagcaggt ttctgccctg aaggcttcct 5160 
aaaaaccaga ctctgtttgg atttggatca 522 0 
ttttgcgcgc gcggtaggcc cgggaccagc 528 0 
tttccaggac gtggtaaagg tgactctgga 534 0 
tggggtggag gtagcaccac tgcagagctt 540 0 
agtcgtagca ggagcgctgg gcgtggtgcc 546 0 
ccaggggcag gcccttggtg taagtgttta 552 0 
gtggggatat gagatgcatc ttggactgta 558 0 
ccctccgggg attcatgttg tgcagaacca 564 0 
atttgtcatg tagcttagaa ggaaatgcgt 5700 
caagattttc catgcattcg tccataatga 5760 
cgaagatatt tctgggatca ctaacgtcat 5 82 0 
catttttaca aagcgcgggc ggagggtgcc 5 880 
aggggcgtag ttaccctcac agatttgcat 594 0 
catgtctacc tgcggggcga tgaagaaaac 6 00 0 
agaaagcagg ttcctgagca gctgcgactt 6 060 
tattaccggg tgcaactggt agttaagaga 612 0 
ggccacttcg ttaagcatgt ccctgactcg 6180 
gcgctcgccg cccagcgata gcagttcttg 624 0 
accgtccgcc gtaggcatgc ttttgagcgt 63 0 0 
ctcggtcacc tgctctacgg catctcgatc 6360 
cggctttcgc tgtacggcag tagtcggtgc 642 0 
cacgggcgca gggtcctcgt cagcgtagtc 6480 
tgcgcgctgg ccagggtgcg cttgaggctg 654 0 
tcgccctgcg cgtcggccag gtagcatttg 6600 
gcgtggccct tggcgcgcag cttgcccttg 6660 
cttttgaggg cgtagagctt gggcgcgaga 672 0 
ccgcaggccc cgcagacggt ctcgcattcc 67 8 0 
tcaaaaacca ggtttccccc atgctttttg 6840 
cggtgtccac gctcggtgac gaaaaggctg 690 0 
ctgtcctcga gcggtgttcc gcggtcctcc 6960 
aaggctcgcg tccaggccag cacgaaggag 7 02 0 
actagggggt ccactcgctc cagggtgtga 7 080 
aaggtgattg gtttgtaggt gtaggccacg 714 0 
aagggggtgg gggcgcgttc gtcctcactc 72 0 0 
tgttggggtg agtactccct ctgaaaagcg 7260 
tccaaaaacg aggaggattt gatattcacc 732 0 
gcatccatct ggtcagaaaa gacaatcttt 73 80 
tagagggcgt tggacagcaa cttggcgatg 744 0 
gcgcgctcct tggccgcgat gtttagctgc 7500 
ggaaagacgg tggtgcgctc gtcgggcacc 756 0 
gtgacaaggt caacgctggt ggctacctct 762 0 
cggccgccct tgcgcgagca gaatggcggt 768 0 
tctgcgtcca cggtaaagac cccgggcagc 7740 
ccttgcaagt ctagcgcctg ctgccatgcg 7 80 0 
agtgggggac cccatggcat ggggtgggtg 786 0 
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agcgcggagg cgtacatgcc gcaaatgtcg taaacgtaga ggggctctct gagtattcca 792 0 
agatatgtag ggtagcatct tccaccgcgg atgctggcgc gcacgtaatc gtatagttcg 7980 
tgcgagggag cgaggaggtc gggaccgagg ttgctacggg cgggctgctc tgctcggaag 8040 
actatctgcc tgaagatggc atgtgagttg gatgatatgg ttggacgctg gaagacgttg 810 0 
aagctggcgt ctgtgagacc taccgcgtca cgcacgaagg aggcgtagga gtcgcgcagc 8160 
ttgttgacca gctcggcggt gacctgcacg tctagggcgc agtagtccag ggtttccttg 8220 
atgatgtcat acttatcctg tccctttttt ttccacagct cgcggttgag gacaaactct 82 80 
tcgcggtctt tccagtactc ttggatcgga aacccgtcgg cctccgaacg gtaagagcct 834 0 
agcatgtaga actggttgac ggcctggtag gcgcagcatc ccttttctac gggtagcgcg 8400 
tatgcctgcg cggccttccg gagcgaggtg tgggtgagcg caaaggtgtc cctgaccatg 8460 
actttgaggt actggtattt gaagtcagtg tcgtcgcatc cgccctgctc ccagagcaaa 852 0 
aagtccgtgc gctttttgga acgcggattt ggcagggcga aggtgacatc gttgaagagt 8580 
atctttcccg cgcgaggcat aaagttgcgt gtgatgcgga agggtcccgg cacctcggaa 8640 
cggttgttaa ttacctgggc ggcgagcacg atctcgtcaa agccgttgat gttgtggccc 8700 
acaatgtaaa gttccaagaa gcgcgggatg cccttgatgg aaggcaattt tttaagttcc 8760 
tcgtaggtga gctcttcagg ggagctgagc ccgtgctctg aaagggccca gtctgcaaga 882 0 
tgagggttgg aagcgacgaa tgagctccac aggtcacggg ccattagcat ttgcaggtgg 88 80 
tcgcgaaagg tcctaaactg gcgacctatg gccatttttt ctggggtgat gcagtagaag 8940 
gtaagcgggt cttgttccca gcggtcccat ccaaggttcg cggctaggtc tcgcgcggca 90 00 
gtcactagag gctcatctcc gccgaacttc atgaccagca tgaagggcac gagctgcttc 9060 
ccaaaggccc ccatccaagt ataggtctct acatcgtagg tgacaaagag acgctcggtg 9120 
cgaggatgcg agccgatcgg gaagaactgg atctcccgcc accaattgga ggagtggcta 9180 
ttgatgtggt gaaagtagaa gtccctgcga cgggccgaac actcgtgctg gcttttgtaa 9240 
aaacgtgcgc agtactggca gcggtgcacg ggctgtacat cctgcacgag gttgacctga 93 00 
cgaccgcgca caaggaagca gagtgggaat ttgagcccct cgcctggcgg gtttggctgg 93 60 
tggtcttcta cttcggctgc ttgtccttga ccgtctggct gctcgagggg agttacggtg 942 0 
gatcggacca ccacgccgcg cgagcccaaa gtccagatgt ccgcgcgcgg cggtcggagc 9480 
ttgatgacaa catcgcgcag atgggagctg tccatggtct ggagctcccg cggcgtcagg 954 0 
tcaggcggga gctcctgcag gtttacctcg catagacggg tcagggcgcg ggctagatcc 96 00 
aggtgatacc taatttccag gggctggttg gtggcggcgt cgatggcttg caagaggccg 9660 
catccccgcg gcgcgactac ggtaccgcgc ggcgggcggt gggccgcggg ggtgtccttg 9720 
gatgatgcat ctaaaagcgg tgacgcgggc gagcccccgg aggtaggggg ggctccggac 97 80 
ccgccgggag agggggcagg ggcacgtcgg cgccgcgcgc gggcaggagc tggtgctgcg 9840 
cgcgtaggtt gctggcgaac gcgacgacgc ggcggttgat ctcctgaatc tggcgcctct 9900 
gcgtgaagac gacgggcccg gtgagcttga gcctgaaaga gagttcgaca gaatcaattt 9960 
cggtgtcgtt gacggcggcc tggcgcaaaa tctcctgcac gtctcctgag ttgtcttgat 10020 
aggcgatctc ggccatgaac tgctcgatct cttcctcctg gagatctccg cgtccggctc 10080 
gctccacggt ggcggcgagg tcgttggaaa tgcgggccat gagctgcgag aaggcgttga 10140 
ggcctccctc gttccagacg cggctgtaga ccacgccccc ttcggcatcg cgggcgcgca 102 0 0 
tgaccacctg cgcgagattg agctccacgt gccgggcgaa gacggcgtag tttcgcaggc 10260 
gctgaaagag gtagttgagg gtggtggcgg tgtgttctgc cacgaagaag tacataaccc 10320 
agcgtcgcaa cgtggattcg ttgatatccc ccaaggcctc aaggcgctcc atggcctcgt 103 80 
agaagtccac ggcgaagttg aaaaactggg agttgcgcgc cgacacggtt aactcctcct 1044 0 
ccagaagacg gatgagctcg gcgacagtgt cgcgcacctc gcgctcaaag gctacagggg 10500 
cctcttcttc ttcttcaatc tcctcttcca taagggcctc cccttcttct tcttctggcg 10560 
gcggtggggg aggggggaca cggcggcgac gacggcgcac cgggaggcgg tcgacaaagc 1062 0 
gctcgatcat ctccccgcgg cgacggcgca tggtctcggt gacggcgcgg ccgttctcgc 10680 
gggggcgcag ttggaagacg ccgcccgtca tgtcccggtt atgggttggc ggggggctgc 10740 
catgcggcag ggatacggcg ctaacgatgc atctcaacaa ttgttgtgta ggtactccgc 108 00 
cgccgaggga cctgagcgag tccgcatcga ccggatcgga aaacctctcg agaaaggcgt 10860 
ctaaccagtc acagtcgcaa ggtaggctga gcaccgtggc gggcggcagc gggcggcggt 10920 
cggggttgtt tctggcggag gtgctgctga tgatgtaatt aaagtaggcg gtcttgagac 10980 
ggcggatggt cgacagaagc accatgtcct tgggtccggc ctgctgaatg cgcaggcggt 1104 0 
cggccatgcc ccaggcttcg ttttgacatc ggcgcaggtc tttgtagtag tcttgcatga 11100 
gcctttctac cggcacttct tcttctcctt cctcttgtcc tgcatctctt gcatctatcg 11160 
ctgcggcggc ggcggagttt ggccgtaggt ggcgccctct tcctcccatg cgtgtgaccc 1122 0 
cgaagcccct catcggctga agcagggcta ggtcggcgac aacgcgctcg gctaatatgg 112 8 0 
cctgctgcac ctgcgtgagg gtagactgga agtcatccat gtccacaaag cggtggtatg 11340 
cgcccgtgtt gatggtgtaa gtgcagttgg ccataacgga ccagttaacg gtctggtgac 114 00 
ccggctgcga gagctcggtg tacctgagac gcgagtaagc cctcgagtca aatacgtagt 11460 
cgttgcaagt ccgcaccagg tactggtatc ccaccaaaaa gtgcggcggc ggctggcggt 1152 0 



WO 2003/062400 



PCT/US2003/002295 



-18- 



agaggggcca gcgtagggtg gccggggctc 
gatatccgta gatgtacctg gacatccagg 
gaaagtcgcg gacgcggttc cagatgttgc 
cgctctggcc ggtcaggcgc gcgcaatcgt 
tgtaagcggg cactcttccg tggtctggtg 
accggggttc gagccccgta tccggccgtc 
gtcgaaccca ggtgtgcgac gtcagacaac 
gcgcggcggc tgctgcgcta gcttttttgg 
gctggaaagc gaaagcatta agtggctcgc 
gttgagtcgc gggacccccg gttcgagtct 
ttgcctcccc gtcatgcaag accccgcttg 
ttttttgctt ttcccagatg catccggtgc 
ggcaagagca agagcagcgg cagacatgca 
gaggggcgac atccgcggtt gacgcggcag 
gggcccggca ctacctggac ttggaggagg 
ctcctgagcg gtacccaagg gtgcagctga 
ggcagaacct gtttcgcgac cgcgagggag 
tccacgcagg gcgcgagctg cggcatggcc 
actttgagcc cgacgcgcga accgggatta 
acctggtaac cgcatacgag cagacggtga 
acaaccacgt gcgtacgctt gtggcgcgcg 
gggactttgt aagcgcgctg gagcaaaacc 
tccttatagt gcagcacagc agggacaacg 
tagagcccga gggccgctgg ctgctcgatt 
aggagcgcag cttgagcctg gctgacaagg 
tgggcaagtt ttacgcccgc aagatatacc 
taaagatcga ggggttctac atgcgcatgg 
tgggcgttta tcgcaacgag cgcatccaca 
tcagcgaccg cgagctgatg cacagcctgc 
atagagaggc cgagtcctac tttgacgcgg 
gcgccctgga ggcagctggg gccggacctg 
acgtcggcgg cgtggaggaa tatgacgagg 
actaagcggt gatgtttctg atcagatgat 
ggcgctgcag agccagccgt ccggccttaa 
ccgcatcatg tcgctgactg cgcgcaatcc 
ccggctctcc gcaattctgg aagcggtggt 
ggtgctggcg atcgtaaacg cgctggccga 
cctggtctac gacgcgctgc ttcagcgcgt 
caacctggac cggctggtgg gggatgtgcg 
gcagcagggc aacctgggct ccatggttgc 
caacgtgccg cggggacagg aggactacac 
gactgagaca ccgcaaagtg aggtgtacca 
tagacaaggc ctgcagaccg taaacctgag 
9ggggt9* c 9g gctcccacag gcgaccgcgc 
gcgcctgttg ctgctgctaa tagcgccctt 
atacctaggt cacttgctga cactgtaccg 
gcatactttc caggagatta caagtgtcag 
cctggaggca accctaaact acctgctgac 
cagtttaaac agcgaggagg agcgcatttt 
cctgatgcgc gacggggtaa cgcccagcgt 
accgggcatg tatgcctcaa accggccgtt 
cgcggccgcc gtgaaccccg agtatttcac 
gccccctggt ttctacaccg ggggattcga 
ggacgacata gacgacagcg tgttttcccc 
gcgcgagcag gcagaggcgg cgctgcgaaa 
cgatctaggc gctgcggccc cgcggtcaga 
gtctcttacc agcactcgca ccacccgccc 
caactcgctg ctgcagccgc agcgcgaaaa 
gatagagagc ctagtggaca agatgagtag 
cgtgccaggc ccgcgcccgc ccacccgtcg 
gtgggaggac gatgactcgg cagacgacag 



cgggggcgag atcttccaac ataaggcgat 115 80 
tgatgccggc ggcggtggtg gaggcgcgcg 1164 0 
gcagcggcaa aaagtgctcc atggtcggga 117 0 0 
tgacgctcta gaccgtgcaa aaggagagcc 117 60 
gataaattcg caagggtatc atggcggacg 1182 0 
cgccgtgatc catgcggtta ccgcccgcgt 118 80 
gggggagtgc tccttttggc ttccttccag 11940 
ccactggccg cgcgcagcgt aagcggttag 12000 
tccctgtagc cggagggtta ttttccaagg 12 0 60 
cggaccggcc ggactgcggc gaacgggggt 1212 0 
caaattcctc cggaaacagg gacgagcccc 12180 
tgcggcagat gcgcccccct cctcagcagc 12240 
gggcaccctc ccctcctcct accgcgtcag 123 0 0 
cagatggtga ttacgaaccc ccgcggcgcc 12360 
gcgagggcct ggcgcggcta ggagcgccct 1242 0 
agcgtgatac gcgtgaggcg tacgtgccgc 12480 
aggagcccga ggagatgcgg gatcgaaagt 1254 0 
tgaatcgcga gcggttgctg cgcgaggagg 126 0 0 
gtcccgcgcg cgcacacgtg gcggccgccg 12 660 
accaggagat taactttcaa aaaagcttta 12 72 0 
aggaggtggc tataggactg atgcatctgt 12780 
caaatagcaa gccgctcatg gcgcagctgt 12840 
aggcattcag ggatgcgctg ctaaacatag 12900 
tgataaacat cctgcagagc atagtggtgc 12960 
tggccgccat caactattcc atgcttagcc 13 020 
atacccctta cgttcccata gacaaggagg 13 08 0 
cgctgaaggt gcttaccttg agcgacgacc 1314 0 
aggccgtgag cgtgagccgg cggcgcgagc 132 00 
aaagggccct ggctggcacg ggcagcggcg 13260 
gcgctgacct gcgctgggcc ccaagccgac 1332 0 
ggctggcggt ggcacccgcg cgcgctggca 133 80 
acgatgagta cgagccagag gacggcgagt 1344 0 
gcaagacgca acggacccgg cggtgcgggc 1350 0 
ctccacggac gactggcgcc aggtcatgga 13560 
tgacgcgttc cggcagcagc cgcaggccaa 13 62 0 
cccggcgcgc gcaaacccca cgcacgagaa 13 680 
aaacagggcc atccggcccg acgaggccgg 13 74 0 
ggctcgttac aacagcggca acgtgcagac 13 8 00 
cgaggccgtg gcgcagcgtg agcgcgcgca 13 860 
actaaacgcc ttcctgagta cacagcccgc 1392 0 
caactttgtg agcgcactgc ggctaatggt 13 980 
gtctgggcca gactattttt tccagaccag 14 040 
ccaggctttc aaaaacttgc aggggctgtg 1410 0 
gaccgtgtct agcttgctga cgcccaactc 14160 
cacggacagt ggcagcgtgt cccgggacac 1422 0 
cgaggccata ggtcaggcgc atgtggacga 142 8 0 
ccgcgcgctg gggcaggagg acacgggcag 14340 
caaccggcgg cagaagatcc cctcgttgca 144 0 0 
gcgctacgtg cagcagagcg tgagccttaa 144 60 
ggcgctggac atgaccgcgc gcaacatgga 1452 0 
tatcaaccgc ctaatggact acttgcatcg 14580 
caatgccatc ttgaacccgc actggctacc 14640 
ggtgcccgag ggtaacgatg gattcctctg 1470 0 
gcaaccgcag accctgctag agttgcaaca 14760 
ggaaagcttc cgcaggccaa gcagcttgtc 14 82 0 
tgctagtagc ccatttccaa gcttgatagg 14880 
gcgcctgctg ggcgaggagg agtacctaaa 14 94 0 
aaacctgcct ccggcatttc ccaacaacgg 15 00 0 
atggaagacg tacgcgcagg agcacaggga 15060 
tcaaaggcac gaccgtcagc ggggtctggt 1512 0 
cagcgtcctg gatttgggag ggagtggcaa 15180 



WO 2003/062400 



PCT/US2003/002295 



-19- 



cccgtttgcg caccttcgcc ccaggctggg 
gcaaaataaa aaactcacca aggccatggc 
agtatgcggc gcgcggcgat gtatgaggaa 
agcgcggcgc cagtggcggc ggcgctgggt 
tttgtgcctc cgcggtacct gcggcctacc 
ttggcacccc tattcgacac cacccgtgtg 
gcatccctga actaccagaa cgaccacagc 
gactacagcc cgggggaggc aagcacacag 
ggcggcgacc tgaaaaccat cctgcatacc 
accaataagt ttaaggcgcg ggtgatggtg 
gagctgaaat acgagtgggt ggagttcacg 
accatagacc ttatgaacaa cgcgatcgtg 
ggggttctgg aaagcgacat cggggtaaag 
gaccccgtca ctggtcttgt catgcctggg 
atcattttgc tgccaggatg cggggtggac 
ggcatccgca agcggcaacc cttccaggag 
ggtggtaaca ttcccgcact gttggatgtg 
accgaacagg gcgggggtgg cgcaggcggc 
aactccaacg cggcagccgc ggcaatgcag 
cgcggcgaca cctttgccac acgggctgag 
gaagctgccg cccccgctgc gcaacccgag 
aaacccctga cagaggacag caagaaacgc 
ttcacccagt accgcagctg gtaccttgca 
cgctcatgga ccctgctttg cactcctgac 
tcgttgccag acatgatgca agaccccgtg 
tttccggtgg tgggcgccga gctgttgccc 
gccgtctact cccaactcat ccgccagttt 
cccgagaacc agattttggc' gcgcccgcca 
gttcctgctc tcacagatca cgggacgcta 
cgagtgacca ttactgacgc cagacgccgc 
atagtctcgc cgcgcgtcct atcgagccgc 
tcgcccagca ataacacagg ctggggcctg 
aagaagcgct ccgaccaaca cccagtgcgc 
gcgcacaaac gcggccgcac tgggcgcacc 
gaggaggcgc gcaactacac gcccacgccg 
cagaccgtgg tgcgcggagc ccggcgctat 
gcacgtcgcc accgccgccg acccggcact 
aaccgcgcac gtcgcaccgg ccgacgggcg 
ggtattgtca ctgtgccccc caggtccagg 
attagtgcta tgactcaggg tcgcaggggc 
ggcctgcgcg tgcccgtgcg cacccgcccc 
ttagactcgt actgttgtat gtatccagcg 
cgcaaaatca aagaagagat gctccaggtc 
aaggaagagc aggattacaa gccccgaaag 
gatgatgatg aacttgacga cgaggtggaa 
gtacagtgga aaggtcgacg cgtaaaacgt 
acgcccggtg agcgctccac ccgcacctac 
gaggacctgc ttgagcaggc caacgagcgc 
aaggacatgc tggcgttgcc gctggacgag 
acactgcagc aggtgctgcc cgcgcttgca 
gagtctggtg acttggcacc caccgtgcag 
gatgtcttgg aaaaaatgac cgtggaacct 
atcaagcagg tggcgccggg actgggcgtg 
agtagcacca gtattgccac cgccacagag 
tcagcggtgg cggatgccgc ggtgcaggcg 
gaggtgcaaa cggacccgtg gatgtttcgc 
aggaagtacg gcgccgccag cgcgctactg 
cctacccccg gctatcgtgg ctacacctac 
cgaaccacca ctggaacccg ccgccgccgt 
tccgtgcgca gggtggctcg cgaaggaggc 
caccccagca tcgtttaaaa gccggtcttt 



gagaatgttt taaaaaaaaa aaagcatgat 1524 0 
accgagcgtt ggttttcttg tattcccctt 153 0 0 
ggtcctcctc cctcctacga gagtgtggtg 153 60 
tctcccttcg atgctcccct ggacccgccg 1542 0 
ggggggagaa acagcatccg ttactctgag 15480 
tacctggtgg acaacaagtc aacggatgtg 1554 0 
aactttctga ccacggtcat tcaaaacaat 1560 0 
accatcaatc ttgacgaccg gtcgcactgg 1566 0 
aacatgccaa atgtgaacga gttcatgttt 1572 0 
tcgcgcttgc ctactaagga caatcaggtg 157 80 
ctgcccgagg gcaactactc cgagaccatg 15 84 0 
gagcactact tgaaagtggg cagacagaac 15900 
tttgacaccc gcaacttcag actggggttt 15960 
gtatatacaa acgaagcctt ccatccagac 16020 
ttcacccaca gccgcctgag caacttgttg 16080 
ggctttagga tcacctacga tgatctggag 1614 0 
gacgcctacc aggcgagctt gaaagatgac 162 0 0 
agcaacagca gtggcagcgg cgcggaagag 162 60 
ccggtggagg acatgaacga tcatgccatt 1632 0 
gagaagcgcg ctgaggccga agcagcggcc 16380 
gtcgagaagc ctcagaagaa accggtgatc 1644 0 
agttacaacc taataagcaa tgacagcacc 16500 
tacaactacg gcgaccctca gaccggaatc 16560 
gtaacctgcg gctcggagca ggtctactgg 1662 0 
accttccgct ccacgcgcca gatcagcaac 16680 
gtgcactcca agagcttcta caacgaccag 16740 
acctctctga cccacgtgtt caatcgcttt 16800 
gcccccacca tcaccaccgt cagtgaaaac 16860 
ccgctgcgca acagcatcgg aggagtccag 1692 0 
acctgcccct acgtttacaa ggccctgggc 16980 
actttttgag caagcatgtc catccttata 17040 
cgcttcccaa gcaagatgtt tggcggggcc 17100 
gtgcgcgggc actaccgcgc gccctggggc 17160 
accgtcgatg acgccatcga cgcggtggtg 17220 
ccaccagtgt ccacagtgga cgcggccatt i72 80 
gctaaaatga agagacggcg gaggcgcgta 17340 
gccgcccaac gcgcggcggc ggccctgctt 174 0 0 
gccatgcggg ccgctcgaag gctggccgcg 17460 
cgacgagcgg ccgccgcagc agccgcggcc 1752 0 
aacgtgtatt gggtgcgcga ctcggttagc 17580 
ccgcgcaact agattgcaag aaaaaactac 17640 
gcggcggcgc gcaacgaagc tatgtccaag 17700 
atcgcgccgg agatctatgg ccccccgaag 17760 
ctaaagcggg tcaaaaagaa aaagaaagat 1782 0 
ctgctgcacg ctaccgcgcc caggcgacgg 17880 
gttttgcgac ccggcaccac cgtagtcttt 1794 0 
aagcgcgtgt atgatgaggt gtacggcgac 180 0 0 
ctcggggagt ttgcctacgg aaagcggcat 180 60 
ggcaacccaa cacctagcct aaagcccgta 1812 0 
ccgtccgaag aaaagcgcgg cctaaagcgc 1818 0 
ctgatggtac ccaagcgcca gcgactggaa 18240 
gggctggagc ccgaggtccg cgtgcggcca 18300 
cagaccgtgg acgttcagat acccactacc 183 60 
ggcatggaga cacaaacgtc cccggttgcc 1842 0 
gtcgctgcgg ccgcgtccaa gacctctacg 18480 
gtttcagccc cccggcgccc gcgcggttcg 18540 
cccgaatatg ccctacatcc ttccattgcg 18600 
cgccccagaa gacgagcaac tacccgacgc 18660 
cgccgtcgcc agcccgtgct ggccccgatt 18720 
aggaccctgg tgctgccaac agcgcgctac 18780 
gtggttcttg cagatatggc cctcacctgc 18840 
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cgcctccgtt tcccggtgcc gggattccga ggaagaatgc accgtaggag gggcatggcc 18900 
ggccacggcc tgacgggcgg catgcgtcgt gcgcaccacc ggcggcggcg cgcgtcgcac 18960 
cgtcgcatgc gcggcggtat cctgcccctc cttattccac tgatcgccgc ggcgattggc 1902 0 
gccgtgcccg gaattgcatc cgtggccttg caggcgcaga gacactgatt aaaaacaagt 19080 
tgcatgtgga aaaatcaaaa taaaaagtct ggactctcac gctcgcttgg tcctgtaact 1914 0 
attttgtaga atggaagaca tcaactttgc gtctctggcc ccgcgacacg gctcgcgccc 19200 
gttcatggga aactggcaag atatcggcac cagcaatatg agcggtggcg ccttcagctg 19260 
gggctcgctg tggagcggca ttaaaaattt cggttccacc gttaagaact atggcagcaa 1932 0 
ggcctggaac agcagcacag gccagatgct gagggataag ttgaaagagc aaaatttcca 19380 
acaaaaggtg gtagatggcc tggcctctgg cattagcggg gtggtggacc tggccaacca 1944 0 
ggcagtgcaa aataagatta acagtaagct tgatccccgc cctcccgtag aggagcctcc 19500 
accggccgtg gagacagtgt ctccagaggg gcgtggcgaa aagcgtccgc gccccgacag 195 60 
ggaagaaact ctggtgacgc aaatagacga gcctccctcg tacgaggagg cactaaagca 19620 
aggcctgccc accacccgtc ccatcgcgcc catggctacc ggagtgctgg gccagcacac 19680 
acccgtaacg ctggacctgc ctccccccgc cgacacccag cagaaacctg tgctgccagg 19740 
cccgaccgcc gttgttgtaa cccgtcctag ccgcgcgtcc ctgcgccgcg ccgccagcgg 19800 
tccgcgatcg ttgcggcccg tagccagtgg caactggcaa agcacactga acagcatcgt 19860 
gggtctgggg gtgcaatccc tgaagcgccg acgatgcttc tgaatagcta acgtgtcgta 19920 
tgtgtgtcat gtatgcgtcc atgtcgccgc cagaggagct gctgagccgc cgcgcgcccg 19980 
ctttccaaga tggctacccc ttcgatgatg ccgcagtggt cttacatgca catctcgggc 20040 
caggacgcct cggagtacct gagccccggg ctggtgcagt ttgcccgcgc caccgagacg 20100 
tacttcagcc tgaataacaa gtttagaaac cccacggtgg cgcctacgca cgacgtgacc 2 0160 
acagaccggt cccagcgttt gacgctgcgg ttcatccctg tggaccgtga ggatactgcg 2 0220 
tactcgtaca aggcgcggtt caccctagct gtgggtgata accgtgtgct ggacatggct 2 0280 
tccacgtact ttgacatccg cggcgtgctg gacaggggcc ctacttttaa gccctactct 2 0340 
ggcactgcct acaacgccct ggctcccaag ggtgccccaa atccttgcga atgggatgaa 2 040 0 
gctgctactg ctcttgaaat aaacctagaa gaagaggacg atgacaacga agacgaagta 2 0460 
gacgagcaag ctgagcagca aaaaactcac gtatttgggc aggcgcctta ttctggtata 2 052 0 
aatattacaa aggagggtat tcaaataggt gtcgaaggtc aaacacctaa atatgccgat 20580 
aaaacatttc aacctgaacc tcaaatagga gaatctcagt ggtacgaaac tgaaattaat 2 064 0 
catgcagctg ggagagtcct taaaaagact accccaatga aaccatgtta cggttcatat 2 07 00 
gcaaaaccca caaatgaaaa tggagggcaa ggcattcttg taaagcaaca aaatggaaag 20760 
ctagaaagtc aagtggaaat gcaatttttc tcaactactg aggcgaccgc aggcaatggt 20820 
gataacttga ctcctaaagt ggtattgtac agtgaagatg tagatataga aaccccagac 2 08 80 
actcatattt cttacatgcc cactattaag gaaggtaact cacgagaact aatgggccaa 2 0940 
caatctatgc ccaacaggcc taattacatt gcttttaggg acaattttat tggtctaatg 210 0 0 
tattacaaca gcacgggtaa tatgggtgtt ctggcgggcc aagcatcgca gttgaatgct 21060 
gttgtagatt tgcaagacag aaacacagag ctttcatacc agcttttgct tgattccatt 2112 0 
ggtgatagaa ccaggtactt ttctatgtgg aatcaggctg ttgacagcta tgatccagat 21180 
gttagaatta ttgaaaatca tggaactgaa gatgaacttc caaattactg ctttccactg 21240 
ggaggtgtga ttaatacaga gactcttacc aaggtaaaac ctaaaacagg tcaggaaaat 213 00 
ggatgggaaa aagatgctac agaattttca gataaaaatg aaataagagt tggaaataat 213 60 
tttgccatgg aaatcaatct aaatgccaac ctgtggagaa atttcctgta ctccaacata 21420 
gcgctgtatt tgcccgacaa gctaaagtac agtccttcca acgtaaaaat ttctgataac 21480 
ccaaacacct acgactacat gaacaagcga gtggtggctc ccgggttagt ggactgctac 2154 0 
attaaccttg gagcacgctg gtcccttgac tatatggaca acgtcaaccc atttaaccac 2160 0 
caccgcaatg ctggcctgcg ctaccgctca atgttgctgg gcaatggtcg ctatgtgccc 21660 
ttccacatcc aggtgcctca gaagttcttt gccattaaaa acctccttct cctgccgggc 21720 
tcatacacct acgagtggaa cttcaggaag gatgttaaca tggttctgca gagctcccta 21780 
ggaaatgacc taagggttga cggagccagc attaagtttg atagcatttg cctttacgcc 21840 
accttcttcc ccatggccca caacaccgcc tccacgcttg aggccatgct tagaaacgac 21900 
accaacgacc agtcctttaa cgactatctc tccgccgcca acatgctcta ccctataccc 21960 
gccaacgcta ccaacgtgcc catatccatc ccctcccgca actgggcggc tttccgcggc 22020 
tgggccttca cgcgccttaa gactaaggaa accccatcac tgggctcggg ctacgaccct 220 80 
tattacacct actctggctc tataccctac ctagatggaa ccttttacct caaccacacc 22140 
tttaagaagg tggccattac ctttgactct tctgtcagct ggcctggcaa tgaccgcctg 222 00 
cttaccccca acgagtttga aattaagcgc tcagttgacg gggagggtta caacgttgcc 222 60 
cagtgtaaca tgaccaaaga ctggttcctg gtacaaatgc tagctaacta caacattggc 22320 
taccagggct tctatatccc agagagctac aaggaccgca tgtactcctt ctttagaaac 223 80 
ttccagccca tgagccgtca ggtggtggat gatactaaat acaaggacta ccaacaggtg 22440 
ggcatcctac accaacacaa caactctgga tttgttggct accttgcccc caccatgcgc 22500 
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gaaggacagg cctaccctgc taacttcccc tatccgctta taggcaagac cgcagttgac 22560 
agcattaccc agaaaaagtt tctttgcgat cgcacccttt ggcgcatccc attctccagt 22 620 
aactttatgt ccatgggcgc actcacagac ctgggccaaa accttctcta cgccaactcc 22 6 80 
gcccacgcgc tagacatgac ttttgaggtg gatcccatgg acgagcccac ccttctttat 22740 
gttttgtttg aagtctttga cgtggtccgt gtgcaccggc cgcaccgcgg cgtcatcgaa 22800 
accgtgtacc tgcgcacgcc cttctcggcc ggcaacgcca caacataaag aagcaagcaa 22860 
catcaacaac agctgccgcc atgggctcca gtgagcagga actgaaagcc attgtcaaag 22 920 
atcttggttg tgggccatat tttttgggca cctatgacaa gcgctttcca ggctttgttt 22980 
ctccacacaa gctcgcctgc gccatagtca atacggccgg tcgcgagact gggggcgtac 2 3 040 
actggatggc ctttgcctgg aacccgcact caaaaacatg ctacctcttt gagccctttg 23100 
gcttttctga ccagcgactc aagcaggttt accagtttga gtacgagtca ctcctgcgcc 23160 
gtagcgccat tgcttcttcc cccgaccgct gtataacgct ggaaaagtcc acccaaagcg 23220 
tacaggggcc caactcggcc gcctgtggac tattctgctg catgtttctc cacgcctttg 23280 
ccaactggcc ccaaactccc atggatcaca accccaccat gaaccttatt accggggtac 23340 
ccaactccat gctcaacagt ccccaggtac agcccaccct gcgtcgcaac caggaacagc 234 0 0 
tctacagctt cctggagcgc cactcgccct acttccgcag ccacagtgcg cagattagga 23460 
gcgccacttc tttttgtcac ttgaaaaaca tgtaaaaata atgtactaga gacactttca 23520 
ataaaggcaa atgcttttat ttgtacactc tcgggtgatt atttaccccc acccttgccg 23580 
tctgcgccgt ttaaaaatca aaggggttct gccgcgcatc gctatgcgcc actggcaggg 23640 
acacgttgcg atactggtgt ttagtgctcc acttaaactc aggcacaacc atccgcggca 23700 
gctcggtgaa gttttcactc cacaggctgc gcaccatcac caacgcgttt agcaggtcgg 23760 
gcgccgatat cttgaagtcg cagttggggc ctccgccctg cgcgcgcgag ttgcgataca 23820 
cagggttgca gcactggaac actatcagcg ccgggtggtg cacgctggcc agcacgctct 23 8 80 
tgtcggagat cagatccgcg tccaggtcct ccgcgttgct cagggcgaac ggagtcaact 23 94 0 
ttggtagctg ccttcccaaa aagggcgcgt gcccaggctt tgagttgcac tcgcaccgta 24000 
gtggcatcaa aaggtgaccg tgcccggtct gggcgttagg atacagcgcc tgcataaaag 24060 
ccttgatctg cttaaaagcc acctgagcct ttgcgccttc agagaagaac atgccgcaag 24120 
acttgccgga aaactgattg gccggacagg ccgcgtcgtg cacgcagcac cttgcgtcgg 24180 
tgttggagat ctgcaccaca tttcggcccc accggttctt cacgatcttg gccttgctag 24240 
actgctcctt cagcgcgcgc tgcccgtttt cgctcgtcac atccatttca atcacgtgct 243 0 0 
ccttatttat cataatgctt ccgtgtagac acttaagctc gccttcgatc tcagcgcagc 24360 
ggtgcagcca caacgcgcag cccgtgggct cgtgatgctt gtaggtcacc tctgcaaacg 2442 0 
actgcaggta cgcctgcagg aatcgcccca tcatcgtcac aaaggtcttg ttgctggtga 24480 
aggtcagctg caacccgcgg tgctcctcgt tcagccaggt cttgcatacg gccgccagag 24540 
cttccacttg gtcaggcagt agtttgaagt tcgcctttag atcgttatcc acgtggtact 24600 
tgtccatcag cgcgcgcgca gcctccatgc ccttctccca cgcagacacg atcggcacac 24660 
tcagcgggtt catcaccgta atttcacttt ccgcttcgct gggctcttcc tcttcctctt 2472 0 
gcgtccgcat accacgcgcc actgggtcgt cttcattcag ccgccgcact gtgcgcttac 24780 
ctcctttgcc atgcttgatt agcaccggtg ggttgctgaa acccaccatt tgtagcgcca 24840 
catcttctct ttcttcctcg ctgtccacga ttacctctgg tgatggcggg cgctcgggct 24900 
tgggagaagg gcgcttcttt ttcttcttgg gcgcaatggc caaatccgcc gccgaggtcg 24960 
atggccgcgg gctgggtgtg cgcggcacca gcgcgtcttg tgatgagtct tcctcgtcct 2502 0 
cggactcgat acgccgcctc atccgctttt ttgggggcgc ccggggaggc ggcggcgacg 25080 
gggacgggga cgacacgtcc tccatggttg ggggacgtcg cgccgcaccg cgtccgcgct 25140 
cgggggtggt ttcgcgctgc tcctcttccc gactggccat ttccttctcc tataggcaga 252 0 0 
aaaagatcat ggagtcagtc gagaagaagg acagcctaac cgccccctct gagttcgcca 25260 
ccaccgcctc caccgatgcc gccaacgcgc ctaccacctt ccccgtcgag gcacccccgc 2532 0 
ttgaggagga ggaagtgatt atcgagcagg acccaggttt tgtaagcgaa gacgacgagg 253 80 
accgctcagt accaacagag gataaaaagc aagaccagga caacgcagag gcaaacgagg 25440 
aacaagtcgg gcggggggac gaaaggcatg gcgactacct agatgtggga gacgacgtgc 255 0 0 
tgttgaagca tctgcagcgc cagtgcgcca ttatctgcga cgcgttgcaa gagcgcagcg 25560 
atgtgcccct cgccatagcg gatgtcagcc ttgcctacga acgccaccta ttctcaccgc 25620 
gcgtaccccc caaacgccaa gaaaacggca catgcgagcc caacccgcgc ctcaacttct 25680 
accccgtatt tgccgtgcca gaggtgcttg ccacctatca catctttttc caaaactgca 25740 
agatacccct atcctgccgt gccaaccgca gccgagcgga caagcagctg gccttgcggc 25800 
agggcgctgt catacctgat atcgcctcgc tcaacgaagt gccaaaaatc tttgagggtc 25860 
ttggacgcga cgagaagcgc gcggcaaacg ctctgcaaca ggaaaacagc gaaaatgaaa 25920 
gtcactctgg agtgttggtg gaactcgagg gtgacaacgc gcgcctagcc gtactaaaac 25980 
gcagcatcga ggtcacccac tttgcctacc cggcacttaa cctacccccc aaggtcatga 26040 
gcacagtcat gagtgagctg atcgtgcgcc gtgcgcagcc cctggagagg gatgcaaatt 26100 
tgcaagaaca aacagaggag ggcctacccg cagttggcga cgagcagcta gcgcgctggc 2 6160 
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ttcaaacgcg cgagcctgcc gacttggagg agcgacgcaa actaatgatg gccgcagtgc 2 6220 
tcgttaccgt ggagcttgag tgcatgcagc ggttctttgc tgacccggag atgcagcgca 2 6280 
agctagagga aacattgcac tacacctttc gacagggcta cgtacgccag gcctgcaaga 2 6340 
tctccaacgt ggagctctgc aacctggtct cctaccttgg aattttgcac gaaaaccgcc 2 6400 
ttgggcaaaa cgtgcttcat tccacgctca agggcgaggc gcgccgcgac tacgtccgcg 2 6460 
actgcgttta cttatttcta tgctacacct ggcagacggc catgggcgtt tggcagcagt 2 652 0 
gcttggagga gtgcaacctc aaggagctgc agaaactgct aaagcaaaac ttgaaggacc 2 6580 
tatggacggc cttcaacgag cgctccgtgg ccgcgcacct ggcggacatc attttccccg 2 6640 
aacgcctgct taaaaccctg caacagggtc tgccagactt caccagtcaa agcatgfctgc 26700 
agaactttag gaactttatc ctagagcgct caggaatctt gcccgccacc tgctgtgcac 26760 
ttcctagcga ctttgtgccc attaagtacc gcgaatgccc tccgccgctt tggggccact 26820 
gctaccttct gcagctagcc aactaccttg cctaccactc tgacataatg gaagacgtga 26880 
gcggtgacgg tctactggag tgtcactgtc gctgcaacct atgcaccccg caccgctccc 26940 
tggtttgcaa ttcgcagctg cttaacgaaa gtcaaattat cggtaccttt gagctgcagg 27000 
gtccctcgcc tgacgaaaag tccgcggctc cggggttgaa actcactccg gggctgtgga 2 70 60 
cgtcggctta ccttcgcaaa tttgtacctg aggactacca cgcccacgag attaggttct 27120 
acgaagacca atcccgcccg ccaaatgcgg agcttaccgc ctgcgtcatt acccagggcc 27180 
acattcttgg ccaattgcaa gccatcaaca aagcccgcca agagtttctg ctacgaaagg 2 7240 
gacggggggt ttacttggac ccccagtccg gcgaggagct caacccaatc cccccgccgc 2 7300 
cgcagcccta tcagcagcag ccgcgggccc ttgcttccca ggatggcacc caaaaagaag 2 7360 
ctgcagctgc cgccgccacc cacggacgag gaggaatact gggacagtca ggcagaggag 2 742 0 
gttttggacg aggaggagga ggacatgatg gaagactggg agagcctaga cgaggaagct 27480 
tccgaggtcg aagaggtgtc agacgaaaca ccgtcaccct cggtcgcatt cccctcgccg 27540 
gcgccccaga aatcggcaac cggttccagc atggctacaa cctccgctcc tcaggcgccg 27600 
ccggcactgc ccgttcgccg acccaaccgt agatgggaca ccactggaac cagggccggt 27660 
aagtccaagc agccgccgcc gttagcccaa gagcaacaac agcgccaagg ctaccgctca 2 772 0 
tggcgcgggc acaagaacgc catagttgct tgcttgcaag actgtggggg caacatctcc 27780 
ttcgcccgcc gctttcttct ctaccatcac ggcgtggcct tcccccgtaa catcctgcat 27840 
tactaccgtc atctctacag cccatactgc accggcggca gcggcagcgg cagcaacagc 27900 
agcggccaca cagaagcaaa ggcgaccgga tagcaagact ctgacaaagc ccaagaaatc 27 960 
cacagcggcg gcagcagcag gaggaggagc gctgcgtctg gcgcccaacg aacccgtatc 2 8 020 
gacccgcgag cttagaaaca ggatttttcc cactctgtat gctatatttc aacagagcag 28 0 80 
gggccaagaa caagagctga aaataaaaaa caggtctctg cgatccctca cccgcagctg 28140 
cctgtatcac aaaagcgaag atcagcttcg gcgcacgctg gaagacgcgg aggctctctt 2 8200 
cagtaaatac tgcgcgctga ctcttaagga ctagtttcgc gccctttctc aaatttaagc 2 82 60 
gcgaaaacta cgtcatctcc agcggccaca cccggcgcca gcacctgtcg tcagcgccat 28320 
tatgagcaag gaaattccca cgccctacat gtggagttac cagccacaaa tgggacttgc 2 83 80 
ggctggagct gcccaagact actcaacccg aataaactac atgagcgcgg gaccccacat 2 8440 
gatatcccgg gtcaacggaa tccgcgccca ccgaaaccga attctcttgg aacaggcggc 28500 
tattaccacc acacctcgta ataaccttaa tccccgtagt tggcccgctg ccctggtgta 28560 
ccaggaaagt cccgctccca ccactgtggt acttcccaga gacgcccagg ccgaagttca 2 8 620 
gatgactaac tcaggggcgc agcttgcggg cggctttcgt cacagggtgc ggtcgcccgg 28680 
gcagggtata actcacctga caatcagagg gcgaggtatt cagctcaacg acgagtcggt 28740 
gagctcctcg cttggtctcc gtccggacgg gacatttcag atcggcggcg ccggccgtcc 2 8800 
ttcattcacg cctcgtcagg caatcctaac tctgcagacc tcgtcctctg agccgcgctc 28860 
tggaggcatt ggaactctgc aatttattga ggagtttgtg ccatcggtct actttaaccc 2 8 920 
cttctcggga cctcccggcc actatccgga tcaatttatt cctaactttg acgcggtaaa 28980 
ggactcggcg gacggctacg actgaatgtt aagtggagag gcagagcaac tgcgcctgaa 2 9040 
acacctggtc cactgtcgcc gccacaagtg ctttgcccgc gactccggtg agttttgcta 29100 
ctttgaattg cccgaggatc atatcgaggg cccggcgcac ggcgtccggc ttaccgccca 29160 
gggagagctt gcccgtagcc tgattcggga gtttacccag cgccccctgc tagttgagcg 2 922 0 
ggacagggga ccctgtgttc tcactgtgat ttgcaactgt cctaaccttg gattacatca 29280 
agatctttgt tgccatctct gtgctgagta taataaatac agaaattaaa atatactggg 29340 
gctcctatcg ccatcctgta aacgccaccg tcttcacccg cccaagcaaa ccaaggcgaa 29400 
ccttacctgg tacttttaac atctctccct ctgtgattta caacagtttc aacccagacg 29460 
gagtgagtct acgagagaac ctctccgagc tcagctactc catcagaaaa aacaccaccc 2 952 0 
tccttacctg ccgggaacgt acgagtgcgt caccggccgc tgcaccacac ctaccgcctg 29580 
accgtaaacc agactttttc cggacagacc tcaataactc tgtttaccag aacaggaggt 29640 
gagcttagaa aacccttagg gtattaggcc aaaggcgcag ctactgtggg gtttatgaac 2 9700 
aattcaagca actctacggg ctattctaat tcaggtttct ctagaaatgg acggaattat 29760 
tacagagcag cgcctgctag aaagacgcag ggcagcggcc gagcaacagc gcatgaatca 2 982 0 
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agagctccaa gacatggtta acttgcacca gtgcaaaagg ggtatctttt gtctggtaaa 2 9880 
gcaggccaaa gtcacctacg acagtaatac caccggacac cgccttagct acaagttgcc 2 9940 
aaccaagcgt cagaaattgg tggtcatggt gggagaaaag cccattacca taactcagca 3 0000 
ctcggtagaa accgaaggct gcattcactc accttgtcaa ggacctgagg atctctgcac 3 0 060 
ccttattaag accctgtgcg gtctcaaaga tcttattccc tttaactaat aaaaaaaaat 3 0120 
aataaagcat cacttactta aaatcagtta gcaaatttct gtccagttta ttcagcagca 3 0180 
cctccttgcc ctcctcccag ctctggtatt gcagcttcct cctggctgca aactttctcc 3 0240 
acaatctaaa tggaatgtca gtttcctcct gttcctgtcc atccgcaccc actatcttca 30300 
tgttgttgca gatgaagcgc gcaagaccgt ctgaagatac cttcaacccc gtgtatccat 3 03 60 
atgacacgga aaccggtcct ccaactgtgc cttttcttac tcctcccttt gtatccccca 30420 
atgggtttca agagagtccc cctggggtac tctctttgcg cctatccgaa cctctagtta 30480 
cctccaatgg catgcttgcg ctcaaaatgg gcaacggcct ctctctggac gaggccggca 3 054 0 
accttacctc ccaaaatgta accactgtga gcccacctct caaaaaaacc aagtcaaaca 30600 
taaacctgga aatatctgca cccctcacag ttacctcaga agccctaact gtggctgccg 3 0660 
ccgcacctct aatggtcgcg ggcaacacac tcaccatgca atcacaggcc ccgctaaccg 3 072 0 
tgcacgactc caaacttagc attgccaccc aaggacccct cacagtgtca gaaggaaagc 3 07 80 
tagccctgca aacatcaggc cccctcacca ccaccgatag cagtaccctt actatcactg 3 0840 
cctcaccccc tctaactact gccactggta gcttgggcat tgacttgaaa gagcccattt 3 09 00 
atacacaaaa tggaaaacta ggactaaagt acggggctcc tttgcatgta acagacgacc 3 0960 
taaacacttt gaccgtagca actggtccag gtgtgactat taataatact tccttgcaaa 3102 0 
ctaaagttac tggagccttg ggttttgatt cacaaggcaa tatgcaactt aatgtagcag 31080 
gaggactaag gattgattct caaaacagac gccttatact tgatgttagt tatccgtttg 3114 0 
atgctcaaaa ccaactaaat ctaagactag gacagggccc tctttttata aactcagccc 31200 
acaacttgga tattaactac aacaaaggcc tttacttgtt tacagcttca aacaattcca 31260 
aaaagcttga ggttaaccta agcactgcca aggggttgat gtttgacgcfc acagccatag 31320 
ccattaatgc aggagatggg cttgaatttg gttcacctaa tgcaccaaac acaaatcccc 313 80 
tcaaaacaaa aattggccat ggcctagaat ttgattcaaa caaggctatg gttcctaaac 3144 0 
taggaactgg ccttagtttt gacagcacag gtgccattac agtaggaaac aaaaataatg 31500 
ataagctaac tttgtggacc acaccagctc catctcctaa ctgtagacta aatgcagaga 31560 
aagatgctaa actcactttg gtcttaacaa aatgtggcag tcaaatactt gctacagttt 31620 
cagttttggc tgttaaaggc agtttggctc caatatctgg aacagttcaa agtgctcatc 316 8 0 
ttattataag atttgacgaa aatggagtgc tactaaacaa ttccttcctg gacccagaat 3174 0 
attggaactt tagaaatgga gatcttactg aaggcacagc ctatacaaac gctgttggat 318 0 0 
ttatgcctaa cctatcagct tatccaaaat ctcacggtaa aactgccaaa agtaacattg 31860 
tcagtcaagt ttacttaaac ggagacaaaa ctaaacctgt aacactaacc attacactaa 3192 0 
acggtacaca ggaaacagga gacacaactc caagtgcata ctctatgtca ttttcatggg 31980 
actggtctgg ccacaactac attaatgaaa tatttgccac atcctcttac actttttcat 32040 
acattgccca agaataaaga atcgtttgtg ttatgtttca acgtgtttat ttttcaattg 32100 
cagaaaattt caagtcattt ttcattcagt agtatagccc caccaccaca tagcttatac 32160 
agatcaccgt accttaatca aactcacaga accctagtat tcaacctgcc acctccctcc 3222 0 
caacacacag agtacacagt cctttctccc cggctggcct taaaaagcat catatcatgg 322 80 
gtaacagaca tattcttagg tgttatattc cacacggttt cctgtcgagc caaacgctca 32340 
tcagtgatat taataaactc cccgggcagc tcacttaagt tcatgtcgct gtccagctgc 32400 
tgagccacag gctgctgtcc aacttgcggt tgcttaacgg gcggcgaagg agaagtccac 32460 
gcctacatgg gggtagagtc ataatcgtgc atcaggatag ggcggtggtg ctgcagcagc 3252 0 
gcgcgaataa actgctgccg ccgccgctcc gtcctgcagg aatacaacat ggcagtggtc 32580 
tcctcagcga tgattcgcac cgcccgcagc ataaggcgcc ttgtcctccg ggcacagcag 32640 
cgcaccctga tctcacttaa atcagcacag taactgcagc acagcaccac aatattgttc 3 27 00 
aaaatcccac agtgcaaggc gctgtatcca aagctcatgg cggggaccac agaacccacg 32760 
tggccatcat accacaagcg caggtagatt aagtggcgac ccctcataaa cacgctggac 3 2 820 
ataaacatta cctcttttgg catgttgtaa ttcaccacct cccggtacca tataaacctc 32880 
tgattaaaca tggcgccatc caccaccatc ctaaaccagc tggccaaaac ctgcccgccg 32940 
gctatacact gcagggaacc gggactggaa caatgacagt ggagagccca ggactcgtaa 33000 
ccatggatca tcatgctcgt catgatatca atgttggcac aacacaggca cacgtgcata 33 060 
cacttcctca ggattacaag ctcctcccgc gttagaacca tatcccaggg aacaacccat 33120 
tcctgaatca gcgtaaatcc cacactgcag ggaagacctc gcacgtaact cacgttgtgc 33180 
attgtcaaag tgttacattc gggcagcagc ggatgatcct ccagtatggt agcgcgggtt 33240 
tctgtctcaa aaggaggtag acgatcccta ctgtacggag tgcgccgaga caaccgagat 333 00 
cgtgttggtc gtagtgtcat gccaaatgga acgccggacg tagtcatatt tcctgaagca 333 60 
aaaccaggtg cgggcgtgac aaacagatct gcgtctccgg tctcgccgct tagatcgctc 33420 
tgtgtagtag ttgtagtata tccactctct caaagcatcc aggcgccccc tggcttcggg 33480 
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ttctatgtaa actccttcat gcgccgctgc cctgataaca tccaccaccg cagaataagc 33540 
cacacccagc caacctacac attcgttctg cgagtcacac acgggaggag cgggaagagc 33600 
tggaagaacc atgttttttt ttttattcca aaagattatc caaaacctca aaatgaagat 33 660 
ctattaagtg aacgcgctcc cctccggtgg cgtggtcaaa ctctacagcc aaagaacaga 3372 0 
taatggcatt tgtaagatgt tgcacaatgg cttccaaaag gcaaacggcc ctcacgtcca 33 78 0 
agtggacgta aaggctaaac ccttcagggt gaatctcctc tataaacatt ccagcacctt 33 840 
caaccatgcc caaataattc tcatctcgcc accttctcaa tatatctcta agcaaatccc 33900 
gaatattaag tccggccatt gtaaaaatct gctccagagc gccctccacc ttcagcctca 33960 
agcagcgaat catgattgca aaaattcagg ttcctcacag acctgtataa gattcaaaag 3402 0 
cggaacatta acaaaaatac cgcgatcccg taggtccctt cgcagggcca gctgaacata 34080 
atcgtgcagg tctgcacgga ccagcgcggc cacttccccg ccaggaacct tgacaaaaga 34140 
acccacactg attatgacac gcatactcgg agctatgcta accagcgtag ccccgatgta 34200 
agctttgttg catgggcggc gatataaaat gcaaggtgct gctcaaaaaa tcaggcaaag 342 60 
cctcgcgcaa aaaagaaagc acatcgtagt catgctcatg cagataaagg caggtaagct 34320 
ccggaaccac cacagaaaaa gacaccattt ttctctcaaa catgtctgcg ggtttctgca 343 80 
taaacacaaa ataaaataac aaaaaaacat ttaaacatta gaagcctgtc ttacaacagg 3444 0 
aaaaacaacc cttataagca taagacggac tacggccatg ccggcgtgac cgtaaaaaaa 34500 
ctggtcaccg tgattaaaaa gcaccaccga cagctcctcg gtcatgtccg gagtcataat 34560 
gtaagactcg gtaaacacat caggttgatt catcggtcag tgctaaaaag cgaccgaaat 3462 0 
agcccggggg aatacatacc cgcaggcgta gagacaacat tacagccccc ataggaggta 34680 
taacaaaatt aataggagag aaaaacacat aaacacctga aaaaccctcc tgcctaggca 34740 
aaatagcacc ctcccgctcc agaacaacat acagcgcttc cacagcggca gccataacag 348 0 0 
tcagccttac cagtaaaaaa gaaaacctat taaaaaaaca ccactcgaca cggcaccagc 34860 
tcaatcagtc acagtgtaaa aaagggccaa gtgcagagcg agtatatata ggactaaaaa 34920 
atgacgtaac ggttaaagtc cacaaaaaac acccagaaaa ccgcacgcga acctacgccc 34980 
agaaacgaaa gccaaaaaac ccacaacttc ctcaaatcgt cacttccgtt ttcccacgtt 35040 
acgtcacttc ccattttaat taagaaaact acaattccca acacatacaa gttactccgc 35100 
cctaaaacct acgtcacccg ccccgttccc acgccccgcg ccacgtcaca aactccaccc 35160 
cctcattatc atattggctt caatccaaaa taaggtatat tattgatgat g 3 5211 



<210> 44 
<211> 33622 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Plasmid Av3nBg 



<400> 44 

catcatcaat aatatacctt 



ttgtgacgtg 
gatgttgcaa 
gtgtgcgccg 
taaatttggg 
agtgaaatct 
gactttgacc 
acgcgtatct 
gttaggagtc 
cttatgcaat 
aaggagagaa 
tattaggaag 
cagagatatt 
cattggtgtg 
ctgttgggct 
aacccgtcgg 
cggatcggaa 
gatcccccgg 
gtagaagacc 
gtcgttttac 
gcacatcccc 
caacagttgc 



gcgcggggcg 
gtgtggcgga 
gtgtacacag 
cgtaaccgag 
gaataatttt 
gtttacgtgg 
gaggggacta 
ccctcaggat 
actcttgtag 
aaagcaccgt 
gcaacagacg 
gtatttaagt 
cacctccggc 
cgcggttgag 
cctccgaacg 
aacctctcga 
gctgcaggaa 
ccaaggactt 
aacgtcgtga 
ctttcgccag 
gcagcctgaa 



attttggatt 
tgggaacggg 
acacatgtaa 
gaagtgacaa 
taagatttgg 
gtgttactca 
agactcgccc 
gggtgtgttt 
atagtagttt 
tcttgcaaca 
gcatgccgat 
ggtctgacat 
gcctagctcg 
cctggccact 
gacaaactct 
gtactccgcc 
gaaaggcgtg 
ttcgatctag 
tccttcagaa 
ctgggaaaac 
ctggcgtaat 
tggcgaatgg 



gaagccaata 
gcgggtgacg 
gcgacggatg 
ttttcgcgcg 
ccattttcgc 
tagcgcgtaa 
agggcgcgcc 
aggcgaaaag 
cgcttttgca 
tggtaacgat 
tggtggaagt 
ggattggacg 
atacaataaa 
ctcttccgca 
tcgcggtctt 
gccgagggac 
taaccagtca 
atggataaag 
ttgctaagtt 
cctggcgtta 
agcgaagagg 
cgctttgcct 



tgataatgag 
tagtagtgtg 
tggcaaaagt 
gttttaggcg 
gggaaaactg 
tatttgtcta 
ccgatgtacg 
cggggcttcg 
tagggagggg 
gagttagcaa 
aaggtggtac 
aaccactgaa 
cgccatttga 
tcgctgtctg 
tccagtactc 
ctgagcgagt 
cagtcgctct 
gtccaaaaaa 
ttttgagtga 
cccaacttaa 
cccgcaccga 
ggtttccggc 



ggggtggagt 

gcggaagtgt 
gacgtttttg 
gatgttgtag 
aataagagga 
gggccgcggg 
ggccagatat 
gttgtacgcg 
gaaatgtagt 
catgccttac 
gatcgtgcct 
ttccgcattg 
ccattcacca 

cgggggccag 

ttggatcgga 
ccgcatcgac 
agaactagtg 
gaagagaaag 
ttcactggcc 
tcgccttgca 
tcgcccttcc 
accagaagcg 
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gtgccggaaa gctggctgga gtgcgatctt 
aactggcaga tgcacggtta cgatgcgccc 
gtcaatccgc cgtttgttcc cacggagaat 
gttgatgaaa gctggctaca ggaaggccag 
gcgtttcatc tgtggtgcaa cgggcgctgg 
tctgaatttg acctgagcgc atttttacgc 
ctgcgttgga gtgacggcag ttatctggaa 
ttccgtgacg tctcgttgct gcataaaccg 
actcgcttta atgatgattt cagccgcgct 
gagttgcgtg actacctacg ggtaacagtt 
agcggcaccg cgcctttcgg cggtgaaatt 
gtcacactac gtctgaacgt cgaaaacccg 
tatcgtgcgg tggttgaact gcacaccgcc 
gatgtcggtt tccgcgaggt gcggattgaa 
ttgctgattc gaggcgttaa ccgtcacgag 
gagcagacga tggtgcagga tatcctgctg 
tgttcgcatt atccgaacca tccgctgtgg 
gtggtggatg aagccaatat tgaaacccac 
gatccgcgct ggctaccggc gatgagcgaa 
aatcacccga gtgtgatcat ctggtcgctg 
gacgcgctgt atcgctggat caaatctgtc 
ggcggagccg acaccacggc caccgatatt 
gaccagccct tcccggctgt gccgaaatgg 
gagacgcgcc cgctgatcct ttgcgaatac 
ttcgctaaat actggcaggc gtttcgtcag 
gactgggtgg atcagtcgct gattaaatat 
ggcggtgatt ttggcgatac gccgaacgat 
gccgaccgca cgccgcatcc agcgctgacg 
ttccgtttat ccgggcaaac catcgaagtg 
aacgagctcc tgcactggat ggtggcgctg 
cctctggatg tcgctccaca aggtaaacag 
gagagcgccg ggcaactctg gctcacagta 
tcagaagccg ggcacatcag cgcctggcag 
acgctccccg ccgcgtccca cgccatcccg 
atcgagctgg gtaataagcg ttggcaattt 
tggattggcg ataaaaaaca actgctgacg 
ctggataacg acattggcgt aagtgaagcg 
cgctggaagg cggcgggcca ttaccaggcc 
acacttgctg atgcggtgct gattacgacc 
ttatttatca gccggaaaac ctaccggatt 
gatgttgaag tggcgagcga tacaccgcat 
gcgcaggtag cagagcgggt aaactggctc 
cgccttactg ccgcctgttt tgaccgctgg 
tacgtcttcc cgagcgaaaa cggtctgcgc 
caccagtggc gcggcgactt ccagttcaac 
gaaaccagcc atcgccatct gctgcacgcg 
ttccatatgg ggattggtgg cgacgactcc 
ctgagcgccg gtcgctacca ttaccagttg 
aagcttatcg ataccgtcga aacttgttta 
atagcatcac aaatttcaca aataaagcat 
ccaaactcat caatgtatct tatcatgtct 
ggtacgatga gacccgcacc aggtgcagac 
accagcctgt gatgctggat gtgaccgagg 
gcacccgcgc tgagtttggc tctagcgatg 
ggcgtggctt aagggtggga aagaatatat 
gttttgcagc agccgccgcc gccatgagca 
catatttgac aacgcgcatg cccccatggg 
gcattgatgg tcgccccgtc ctgcccgcaa 
ctggaacgcc gttggagact gcagcctccg 
gcgggattgt gactgacttt gctttcctga 
catccgcccg cgatgacaag ttgacggctc 



cctgaggccg atactgtcgt cgtcccctca 13 80 
atctacacca acgtaaccta tcccattacg 1440 
ccgacgggtt gttactcgct cacatttaat 15 0 0 
acgcgaatta tttttgatgg cgttaactcg 1560 
gtcggttacg gccaggacag tcgtttgccg 162 0 
gccggagaaa accgcctcgc ggtgatggtg 1680 
gatcaggata tgtggcggat gagcggcatt 1740 
actacacaaa tcagcgattt ccatgttgcc 18 0 0 
gtactggagg ctgaagttca gatgtgcggc 186 0 
tctttatggc agggtgaaac gcaggtcgcc 192 0 
atcgatgagc gtggtggtta tgccgatcgc 198 0 
aaactgtgga gcgccgaaat cccgaatctc 2 04 0 
gacggcacgc tgattgaagc agaagcctgc 210 0 
aatggtctgc tgctgctgaa cggcaagccg 2160 
catcatcctc tgcatggtca ggtcatggat 2220 
atgaagcaga acaactttaa cgccgtgcgc 2280 
tacacgctgt gcgaccgcta cggcctgtat 234 0 
ggcatggtgc caatgaatcg tctgaccgat 240 0 
cgcgtaacgc gaatggtgca gcgcgatcgt 24 60 
gggaatgaat caggccacgg cgctaatcac 252 0 
gatccttccc gcccggtgca gtatgaaggc 2 58 0 
atttgcccga tgtacgcgcg cgtggatgaa 2 640 
tccatcaaaa aatggctttc gctacctgga 2700 
gcccacgcga tgggtaacag tcttggcggt 2 76 0 
tatccccgtt tacagggcgg cttcgtctgg 2 82 0 
gatgaaaacg gcaacccgtg gtcggcttac 2 88 0 
cgccagttct gtatgaacgg tctggtcttt 2 94 0 
gaagcaaaac accagcagca gtttttccag 3 00 0 
accagcgaat acctgttccg tcatagcgat 3 06 0 
gatggtaagc cgctggcaag cggtgaagtg 312 0 
ttgattgaac tgcctgaact accgcagccg 318 0 
cgcgtagtgc aaccgaacgc gaccgcatgg 324 0 
cagtggcgtc tggcggaaaa cctcagtgtg 33 0 0 
catctgacca ccagcgaaat ggatttttgc 3360 
aaccgccagt caggctttct ttcacagatg 342 0 
ccgctgcgcg atcagttcac ccgtgcaccg 3480 
acccgcattg accctaacgc ctgggtcgaa 3540 
gaagcagcgt tgttgcagtg cacggcagat 3 60 0 
gctcacgcgt ggcagcatca ggggaaaacc 3 66 0 
gatggtagtg gtcaaatggc gattaccgtt 3 72 0 
ccggcgcgga ttggcctgaa ctgccagctg 3 78 0 
ggattagggc cgcaagaaaa ctatcccgac 3 84 0 
gatctgccat tgtcagacat gtataccccg 3 90 0 
tgcgggacgc gcgaattgaa ttatggccca 3 960 
atcagccgct acagtcaaca gcaactgatg 402 0 
gaagaaggca catggctgaa tatcgacggt 4080 
tggagcccgt cagtatcggc ggaatttcag 4140 
gtctggtgtc aaaaataata atctcgaatc 42 0 0 
ttgcagctta taatggttac aaataaagca 42 60 
ttttttcact gcattctagt tgtggtttgt 4320 
ggatccgacc tcggatctgg aaggtgctga 43 80 
cctgcgagtg tggcggtaaa catattagga 444 0 
agctgaggcc cgatcacttg gtgctggcct 4500 
aagatacaga ttgaggtact gaaatgtgtg 4560 
aaggtggggg tcttatgtag ttttgtatct 4620 
ccaactcgtt tgatggaagc attgtgagct 46 80 
ccggggtgcg tcagaatgtg atgggctcca 4740 
actctactac cttgacctac gagaccgtgt 48 00 
ccgccgcttc agccgctgca gccaccgccc 4 860 
gcccgcttgc aagcagtgca gcttcccgtt 4920 
ttttggcaca attggattct ttgacccggg 4980 
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aacttaatgt cgtttctcag cagctgttgg atctgcgcca gcaggtttct gccctgaagg 5040 
cttcctcccc tcccaatgcg gtttaaaaca taaataaaaa accagactct gtttggattt 510 0 
ggatcaagca agtgtcttgc tgtctttatt taggggtttt gcgcgcgcgg taggcccggg 5160 
accagcggtc tcggtcgttg agggtcctgt gtattttttc caggacgtgg taaaggtgac 522 0 
tctggatgtt cagatacatg ggcataagcc cgtctctggg gtggaggtag caccactgca 52 80 
gagcttcatg ctgcggggtg gtgttgtaga tgatccagtc gtagcaggag cgctgggcgt 5340 
ggtgcctaaa aatgtctttc agtagcaagc tgattgccag gggcaggccc ttggtgtaag 54 0 0 
tgtttacaaa gcggttaagc tgggatgggt gcatacgtgg ggatatgaga tgcatcttgg 54 60 
actgtatttt taggttggct atgttcccag ccatatccct ccggggattc atgttgtgca 5520 
gaaccaccag cacagtgtat ccggtgcact tgggaaattt gtcatgtagc ttagaaggaa 55 8 0 
atgcgtggaa gaacttggag acgcccttgt gacctccaag attttccatg cattcgtcca 5640 
taatgatggc aatgggccca cgggcggcgg cctgggcgaa gatatttctg ggatcactaa 5700 
cgtcatagtt gtgttccagg atgagatcgt cataggccat ttttacaaag cgcgggcgga 5760 
gggtgccaga ctgcggtata atggttccat ccggcccagg ggcgtagtta ccctcacaga 5 82 0 
tttgcatttc ccacgctttg agttcagatg gggggatcat gtctacctgc ggggcgatga 58 80 
agaaaacggt ttccggggta ggggagatca gctgggaaga aagcaggttc ctgagcagct 5940 
gcgacttacc gcagccggtg ggcccgtaaa tcacacctat taccggctgc aactggtagt 60 00 
taagagagct gcagctgccg tcatccctga gcaggggggc cacttcgtta agcatgtccc 6060 
tgactcgcat gttttccctg accaaatccg ccagaaggcg ctcgccgccc agcgatagca 6120 
gttcttgcaa ggaagcaaag tttttcaacg gtttgagacc gtccgccgta ggcatgcttt 6180 
tgagcgtttg accaagcagt tccaggcggt cccacagctc ggtcacctgc tctacggcat 6240 
ctcgatccag catatctcct cgtttcgcgg gttggggcgg ctttcgctgt acggcagtag 63 0 0 
tcggtgctcg tccagacggg ccagggtcat gtctttccac gggcgcaggg tcctcgtcag 63 60 
cgtagtctgg gtcacggtga aggggtgcgc tccgggctgc gcgctggcca gggtgcgctt 642 0 
gaggctggtc ctgctggtgc tgaagcgctg ccggtcttcg ccctgcgcgt cggccaggta 64 8 0 
gcatttgacc atggtgtcat agtccagccc ctccgcggcg tggcccttgg cgcgcagctt 6540 
gcccttggag gaggcgccgc acgaggggca gtgcagactt ttgagggcgt agagcttggg 6600 
cgcgagaaat accgattccg gggagtaggc atccgcgccg caggccccgc agacggtctc 6660 
gcattccacg agccaggtga gctctggccg ttcggggtca aaaaccaggt ttcccccatg 6720 
ctttttgatg cgtttcttac ctctggtttc catgagccgg tgtccacgct cggtgacgaa 6780 
aaggctgtcc gtgtccccgt atacagactt gagaggcctg tcctcgagcg gtgttccgcg 6 84 0 
gtcctcctcg tatagaaact cggaccactc tgagacaaag gctcgcgtcc aggccagcac 6900 
gaaggaggct aagtgggagg ggtagcggtc gttgtccact agggggtcca ctcgctccag 6960 
ggtgtgaaga cacatgtcgc cctcttcggc atcaaggaag gtgattggtt tgtaggtgta 7 02 0 
ggccacgtga ccgggtgttc ctgaaggggg gctataaaag ggggtggggg cgcgttcgtc 7 08 0 
ctcactctct tccgcatcgc tgtctgcgag ggccagctgt tggggtgagt actccctctg 7140 
aaaagcgggc atgacttctg cgctaagatt gtcagtttcc aaaaacgagg aggatttgat 72 0 0 
attcacctgg cccgcggtga tgcctttgag ggtggccgca tccatctggt cagaaaagac 72 6 0 
aatctttttg ttgtcaagct tggtggcaaa cgacccgtag agggcgttgg acagcaactt 7320 
ggcgatggag cgcagggttt ggtttttgtc gcgatcggcg cgctccttgg ccgcgatgtt 73 8 0 
tagctgcacg tattcgcgcg caacgcaccg ccattcggga aagacggtgg tgcgctcgtc 7440 
gggcaccagg tgcacgcgcc aaccgcggtt gtgcagggtg acaaggtcaa cgctggtggc 75 0 0 
tacctctccg cgtaggcgct cgttggtcca gcagaggcgg ccgcccttgc gcgagcagaa 7560 
tggcggtagg gggtctagct gcgtctcgtc cggggggtct gcgtccacgg taaagacccc 7 62 0 
gggcagcagg cgcgcgtcga agtagtctat cttgcatcct tgcaagtcta gcgcctgctg 7680 
ccatgcgcgg gcggcaagcg cgcgctcgta tgggttgagt gggggacccc atggcatggg 774 0 
gtgggtgagc gcggaggcgt acatgccgca aatgtcgtaa acgtagaggg gctctctgag 7 80 0 
tattccaaga tatgtagggt agcatcttcc accgcggatg ctggcgcgca cgtaatcgta 7 860 
tagttcgtgc gagggagcga ggaggtcggg accgaggttg ctacgggcgg gctgctctgc 792 0 
tcggaagact atctgcctga agatggcatg tgagttggat gatatggttg gacgctggaa 7980 
gacgttgaag ctggcgtctg tgagacctac cgcgtcacgc acgaaggagg cgtaggagtc 8 040 
gcgcagcttg ttgaccagct cggcggtgac ctgcacgtct agggcgcagt agtccagggt 8100 
ttccttgatg atgtcatact tatcctgtcc cttttttttc cacagctcgc ggttgaggac 8160 
aaactcttcg cggtctttcc agtactcttg gatcggaaac ccgtcggcct ccgaacggta 822 0 
agagcctagc atgtagaact ggttgacggc ctggtaggcg cagcatccct tttctacggg 82 8 0 
tagcgcgtat gcctgcgcgg ccttccggag cgaggtgtgg gtgagcgcaa aggtgtccct 8340 
gaccatgact ttgaggtact ggtatttgaa gtcagtgtcg tcgcatccgc cctgctccca 84 0 0 
gagcaaaaag tccgtgcgct ttttggaacg cggatttggc agggcgaagg tgacatcgtt 846 0 
gaagagtatc tttcccgcgc gaggcataaa gttgcgtgtg atgcggaagg gtcccggcac 852 0 
ctcggaacgg ttgttaatta cctgggcggc gagcacgatc tcgtcaaagc cgttgatgtt 858 0 
gtggcccaca atgtaaagtt ccaagaagcg cgggatgccc ttgatggaag gcaatttttt 8640 
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aagttcctcg taggtgagct cttcagggga gctgagcccg tgctctgaaa gggcccagtc 87 0 0 
tgcaagatga gggttggaag cgacgaatga gctccacagg tcacgggcca ttagcatttg 87 60 
caggtggtcg cgaaaggtcc taaactggcg acctatggcc attttttctg gggtgatgca 8820 
gtagaaggta agcgggtctt gttcccagcg gtcccatcca aggttcgcgg ctaggtctcg 8880 
cgcggcagtc actagaggct catctccgcc gaacttcatg accagcatga agggcacgag 8940 
ctgcttccca aaggccccca tccaagtata ggtctctaca tcgtaggtga caaagagacg 90 00 
ctcggtgcga ggatgcgagc cgatcgggaa gaactggatc tcccgccacc aattggagga 9060 
gtggctattg atgtggtgaa agtagaagtc cctgcgacgg gccgaacact cgtgctggct 912 0 
tttgtaaaaa cgtgcgcagt actggcagcg gtgcacgggc tgtacatcct gcacgaggtt 918 0 
gacctgacga ccgcgcacaa ggaagcagag tgggaatttg agcccctcgc ctggcgggtt 924 0 
tggctggtgg tcttctactt cggctgcttg tccttgaccg tctggctgct cgaggggagt 93 0 0 
tacggtggat cggaccacca cgccgcgcga gcccaaagtc cagatgtccg cgcgcggcgg 93 60 
tcggagcttg atgacaacat cgcgcagatg ggagctgtcc atggtctgga gctcccgcgg 942 0 
cgtcaggtca ggcgggagct cctgcaggtt tacctcgcat agacgggtca gggcgcgggc 94 80 
tagatccagg'tgatacctaa tttccagggg ctggttggtg gcggcgtcga tggcttgcaa 9540 
gaggccgcat ccccgcggcg cgactacggt accgcgcggc gggcggtggg ccgcgggggt 960 0 
gtccttggat gatgcatcta aaagcggtga cgcgggcgag cccccggagg tagggggggc 9660 
tccggacccg ccgggagagg gggcaggggc acgtcggcgc cgcgcgcggg caggagctgg 972 0 
tgctgcgcgc gtaggttgct ggcgaacgcg acgacgcggc ggttgatctc ctgaatctgg 9780 
cgcctctgcg tgaagacgac gggcccggtg agcttgagcc tgaaagagag ttcgacagaa 9 840 
tcaatttcgg tgtcgttgac ggcggcctgg cgcaaaatct cctgcacgtc tcctgagttg 9900 
tcttgatagg cgatctcggc catgaactgc tcgatctctt cctcctggag atctccgcgt 9960 
ccggctcgct ccacggtggc ggcgaggtcg ttggaaatgc gggccatgag ctgcgagaag 10 02 0 
gcgttgaggc ctccctcgtt ccagacgcgg ctgtagacca cgcccccttc ggcatcgcgg 10080 
gcgcgcatga ccacctgcgc gagattgagc tccacgtgcc gggcgaagac ggcgtagttt 10140 
cgcaggcgct gaaagaggta gttgagggtg gtggcggtgt gttctgccac gaagaagtac 10200 
ataacccagc gtcgcaacgt ggattcgttg atatccccca aggcctcaag gcgctccatg 10260 
gcctcgtaga agtccacggc gaagttgaaa aactgggagt tgcgcgccga cacggttaac 1032 0 
tcctcctcca gaagacggat gagctcggcg acagtgtcgc gcacctcgcg ctcaaaggct 10380 
acaggggcct cttcttcttc ttcaatctcc tcttccataa gggcctcccc ttcttcttct 10440 
tctggcggcg gtgggggagg ggggacacgg cggcgacgac ggcgcaccgg gaggcggtcg 105 00 
acaaagcgct cgatcatctc cccgcggcga cggcgcatgg tctcggtgac ggcgcggccg 10560 
ttctcgcggg ggcgcagttg gaagacgccg cccgtcatgt cccggttatg ggttggcggg 10 62 0 
gggctgccat gcggcaggga tacggcgcta acgatgcatc tcaacaattg ttgtgtaggt 106 80 
actccgccgc cgagggacct gagcgagtcc gcatcgaccg gatcggaaaa cctctcgaga 10740 
aaggcgtcta accagtcaca gtcgcaaggt aggctgagca ccgtggcggg cggcagcggg 10 8 00 
cggcggtcgg ggttgtttct ggcggaggtg ctgctgatga tgtaattaaa gtaggcggtc 10860 
ttgagacggc ggatggtcga cagaagcacc atgtccttgg gtccggcctg ctgaatgcgc 10920 
aggcggtcgg ccatgcccca ggcttcgttt tgacatcggc gcaggtcttt gtagtagtct 10980 
tgcatgagcc tttctaccgg cacttcttct tctccttcct cttgtcctgc atctcttgca 11040 
tctatcgctg cggcggcggc ggagtttggc cgtaggtggc gccctcttcc tcccatgcgt 11100 
gtgaccccga agcccctcat cggctgaagc agggctaggt cggcgacaac gcgctcggct 11160 
aatatggcct gctgcacctg cgtgagggta gactggaagt catccatgtc cacaaagcgg 1122 0 
tggtatgcgc ccgtgttgat ggtgtaagtg cagttggcca taacggacca gttaacggtc 112 8 0 
tggtgacccg gctgcgagag ctcggtgtac ctgagacgcg agtaagccct cgagtcaaat 11340 
acgtagtcgt tgcaagtccg caccaggtac tggtatccca ccaaaaagtg cggcggcggc 11400 
tggcggtaga ggggccagcg tagggtggcc ggggctccgg gggcgagatc ttccaacata 11460 
aggcgatgat atccgtagat gtacctggac atccaggtga tgccggcggc ggtggtggag 1152 0 
gcgcgcggaa agtcgcggac gcggttccag atgttgcgca gcggcaaaaa gtgctccatg 115 80 
gtcgggacgc tctggccggt caggcgcgcg caatcgttga cgctctagac cgtgcaaaag 11640 
gagagcctgt aagcgggcac tcttccgtgg tctggtggat aaattcgcaa gggtatcatg 11700 
gcggacgacc ggggttcgag ccccgtatcc ggccgtccgc cgtgatccat gcggttaccg 11760 
cccgcgtgtc gaacccaggt gtgcgacgtc agacaacggg ggagtgctcc ttttggcttc 1182 0 
cttccaggcg cggcggctgc tgcgctagct tttttggcca ctggccgcgc gcagcgtaag 11880 
cggttaggct ggaaagcgaa agcattaagt ggctcgctcc ctgtagccgg agggttattt 11940 
tccaagggtt gagtcgcggg acccccggtt cgagtctcgg accggccgga ctgcggcgaa 120 00 
cgggggtttg cctccccgtc atgcaagacc ccgcttgcaa attcctccgg aaacagggac 12 0 60 
gagccccttt tttgcttttc ccagatgcat ccggtgctgc ggcagatgcg cccccctcct 12120 
cagcagcggc aagagcaaga gcagcggcag acatgcaggg caccctcccc tcctcctacc 12180 
gcgtcaggag gggcgacatc cgcggttgac gcggcagcag atggtgatta cgaacccccg 12240 
cggcgccggg cccggcacta cctggacttg gaggagggcg agggcctggc gcggctagga 123 0 0 
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gcgccctctc 
gtgccgcggc 
cgaaagttcc 
gaggaggact 
gccgccgacc 
agctttaaca 
catctgtggg 
cagctgttcc 
aacatagtag 
gtggtgcagg 
cttagcctgg 
aaggaggtaa 
gacgacctgg 
cgcgagctca 
agcggcgata 
agccgacgcg 
gctggcaacg 
ggcgagtact 
tgcgggcggc 
tcatggaccg 
aggccaaccg 
acgagaaggt 
aggccggcct 
tgcagaccaa 
gcgcgcagca 
agcccgccaa 
taatggtgac 
agaccagtag 
ggctgtgggg 
ccaactcgcg 
gggacacata 
tggacgagca 
cgggcagcct 
cgttgcacag 
gccttaacct 
acatggaacc 
tgcatcgcgc 
ggctaccgcc 
tcctctggga 
tgcaacagcg 
gcttgtccga 
tgatagggtc 
acctaaacaa 
acaacgggat 
acagggacgt 
gtctggtgtg 
gtggcaaccc 
gcatgatgca 
tccccttagt 
tgtggtgagc 
cccgccgttt 
ctctgagttg 
ggatgtggca 
aaacaatgac 
gcactggggc 
catgtttacc 
tcaggtggag 
gaccatgacc 
acagaacggg 

ggggtttgac 

tccagacatc 



ctgagcggta 
agaacctgtt 
acgcagggcg 
ttgagcccga 
tggtaaccgc 
accacgtgcg 
actttgtaag 
ttatagtgca 
agcccgaggg 
agcgcagctt 
gcaagtttta 
agatcgaggg 
gcgtttatcg 
gcgaccgcga 
gagaggccga 
ccctggaggc 
tcggcggcgt 
aagcggtgat 
gctgcagagc 
catcatgtcg 
gctctccgca 
gctggcgatc 
ggtctacgac 
cctggaccgg 
gcagggcaac 
cgtgccgcgg 
tgagacaccg 
acaaggcctg 
ggtgcgggct 
cctgttgctg 
cctaggtcac 
tactttccag 
ggaggcaacc 
tttaaacagc 
gatgcgcgac 
gggcatgtat 
ggccgccgtg 
ccctggtttc 
cgacatagac 
cgagcaggca 
tctaggcgct 
tcttaccagc 
ctcgctgctg 
agagagccta 
gccaggcccg 
ggaggacgat 
gtttgcgcac 
aaataaaaaa 
atgcggcgcg 
gcggcgccag 
gtgcctccgc 
gcacccctat 
tccctgaact 
tacagcccgg 
ggcgacctga 
aataagttta 
ctgaaatacg 
atagacctta 
gttctggaaa 
cccgtcactg 
attttgctgc 



cccaagggtg 
tcgcgaccgc 
cgagctgcgg 
cgcgcgaacc 
atacgagcag 
tacgcttgtg 
cgcgctggag 
gcacagcagg 
ccgctggctg 
gagcctggct 
cgcccgcaag 
gttctacatg 
caacgagcgc 
gctgatgcac 
gtcctacttt 
agctggggcc 
ggaggaatat 
gtttctgatc 
cagccgtccg 
ctgactgcgc 
attctggaag 
gtaaacgcgc 
gcgctgcttc 
ctggtggggg 
ctgggctcca 
ggacaggagg 
caaagtgagg 
cagaccgtaa 
cccacaggcg 
ctgctaatag 
ttgctgacac 
gagattacaa 
ctaaactacc 
gaggaggagc 
ggggtaacgc 
gcctcaaacc 
aaccccgagt 
tacaccgggg 
gacagcgtgt 
gaggcggcgc 
gcggccccgc 
actcgcacca 
cagccgcagc 
gtggacaaga 
cgcccgccca 
gactcggcag 
cttcgcccca 
ctcaccaagg 
cggcgatgta 
tggcggcggc 
ggtacctgcg 
tcgacaccac 
accagaacga 
gggaggcaag 
aaaccatcct 
aggcgcgggt 
agtgggtgga 
tgaacaacgc 
gcgacatcgg 
gtcttgtcat 
caggatgcgg 



cagctgaagc 
gagggagagg 
catggcctga 
gggattagtc 
acggtgaacc 
gcgcgcgagg 
caaaacccaa 
gacaacgagg 
ctcgatttga 
gacaaggtgg 
atataccata 
cgcatggcgc 
atccacaagg 
agcctgcaaa 
gacgcgggcg 
ggacctgggc 
gacgaggacg 
agatgatgca 
gccttaactc 
gcaatcctga 
cggtggtccc 
tggccgaaaa 
agcgcgtggc 
atgtgcgcga 
tggttgcact 
actacaccaa 
tgtaccagtc 
acctgagcca 
accgcgcgac 
cgcccttcac 
tgtaccgcga 
gtgtcagccg 
tgctgaccaa 
gcattttgcg 
ccagcgtggc 
ggccgtttat 
atttcaccaa 
gattcgaggt 
tttccccgca 
tgcgaaagga 
ggtcagatgc 
cccgcccgcg 
gcgaaaaaaa 
tgagtagatg 
cccgtcgtca 
acgacagcag 
ggctggggag 
ccatggcacc 
tgaggaaggt 
gctgggttct 
gcctaccggg 
ccgtgtgtac 
ccacagcaac 
cacacagacc 
gcataccaac 
gatggtgtcg 
gttcacgctg 
gatcgtggag 
ggtaaagttt 
gcctggggta 
ggtggacttc 



gtgatacgcg 
agcccgagga 
atcgcgagcg 
ccgcgcgcgc 
aggagattaa 
aggtggctat 
atagcaagcc 
cattcaggga 
taaacatcct 
ccgccatcaa 
ccccttacgt 
tgaaggtgct 
ccgtgagcgt 
gggccctggc 
ctgacctgcg 
tggcggtggc 
atgagtacga 
agacgbaacg 
cacggacgac 
cgcgttccgg 
ggcgcgcgca 
cagggccatc 
tcgttacaac 
ggccgtggcg 
aaacgccttc 
ctttgtgagc 
tgggccagac 
ggctttcaaa 
cgtgtctagc 
ggacagtggc 
ggccataggt 
cgcgctgggg 
ccggcggcag 
ctacgtgcag 
gctggacatg 
caaccgccta 
tgccatcttg 
gcccgagggt 
accgcagacc 
aagcttccgc 
tagtagccca 
cctgctgggc 
cctgcctccg 
gaagacgtac 
aaggcacgac 
cgtcctggat 
aatgttttaa 
gagcgttggt 
cctcctccct 
cccttcgatg 
gggagaaaca 
ctggtggaca 
tttctgacca 
atcaatcttg 
atgccaaatg 
cgcttgccta 
cccgagggca 
cactacttga 
gacacccgca 
tatacaaacg 
acccacagcc 



tgaggcgtac 
gatgcgggat 
gttgctgcgc 
acacgtggcg 
ctttcaaaaa 
aggactgatg 
gctcatggcg 
tgcgctgcta 
gcagagcata 
ctattccatg 
tcccatagac 
taccttgagc 
gagccggcgg 
tggcacgggc 
ctgggcccca 
acccgcgcgc 
gccagaggac 
gacccggcgg 
tggcgccagg 
cagcagccgc 
aaccccacgc 
cggcccgacg 
agcggcaacg 
cagcgtgagc 
ctgagtacac 
gcactgcggc 
tattttttcc 
aacttgcagg 
ttgctgacgc 
agcgtgtccc 
caggcgcatg 
caggaggaca 
aagatcccct 
cagagcgtga 
accgcgcgca 
atggactact 
aacccgcact 
aacgatggat 
ctgctagagt 
aggccaagca 
tttccaagct 
gaggaggagt 
gcatttccca 
gcgcaggagc 
cgtcagcggg 
ttgggaggga 
aaaaaaaaaa 
tttcttgtat 
cctacgagag 
ctcccctgga 
gcatccgtta 
acaagtcaac 
cggtcattca 
acgaccggtc 
tgaacgagtt 
ctaaggacaa 
actactccga 
aagtgggcag 
acttcagact 
aagccttcca 
gcctgagcaa 



12360 
12420 
12480 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 
13800 
13860 
13920 
13980 
14040 
14100 
14160 
14220 
14280 
14340 
14400 
14460 
14520 
14580 
14640 
14700 
14760 
14820 
14880 
14940 
15000 
15060 
15120 
15180 
15240 
15300 
15360 
15420 
15480 
15540 
15600 
15660 
15720 
15780 
15840 
15900 
15960 
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cttgttgggc atccgcaagc ggcaaccctt ccaggagggc tttaggatca cctacgatga 1602 0 
tctggagggt ggtaacattc ccgcactgtt ggatgtggac gcctaccagg cgagcttgaa 16080 
agatgacacc gaacagggcg ggggtggcgc aggcggcagc aacagcagtg gcagcggcgc 1614 0 
ggaagagaac tccaacgcgg cagccgcggc aatgcagccg gtggaggaca tgaacgatca 16200 
tgccattcgc ggcgacacct ttgccacacg ggctgaggag aagcgcgctg aggccgaagc 16260 
agcggccgaa gctgccgccc ccgctgcgca acccgaggtc gagaagcctc agaagaaacc 1632 0 
ggtgatcaaa cccctgacag aggacagcaa gaaacgcagt tacaacctaa taagcaatga 16380 
cagcaccttc acccagtacc gcagctggta ccttgcatac aactacggcg accctcagac 1644 0 
cggaatccgc tcatggaccc tgctttgcac tcctgacgta acctgcggct cggagcaggt 16500 
ctactggtcg ttgccagaca tgatgcaaga ccccgtgacc ttccgctcca cgcgccagat 16560 
cagcaacttt ccggtggtgg gcgccgagct gttgcccgtg cactccaaga gcttctacaa 16620 
cgaccaggcc gtctactccc aactcatccg ccagtttacc tctctgaccc acgtgttcaa 16680 
tcgctttccc gagaaccaga ttttggcgcg cccgccagcc cccaccatca ccaccgtcag 16740 
tgaaaacgtt cctgctctca cagatcacgg gacgctaccg ctgcgcaaca gcatcggagg 168 0 0 
agtccagcga gtgaccatta ctgacgccag acgccgcacc tgcccctacg tttacaaggc 16860 
cctgggcata gtctcgccgc gcgtcctatc gagccgcact ttttgagcaa gcatgtccat 1692 0 
ccttatatcg cccagcaata acacaggctg gggcctgcgc ttcccaagca agatgtttgg 16980 
cggggccaag aagcgctccg accaacaccc agtgcgcgtg cgcgggcact accgcgcgcc 17040 
ctggggcgcg cacaaacgcg gccgcactgg gcgcaccacc gtcgatgacg ccatcgacgc 1710 0 
ggtggtggag gaggcgcgca actacacgcc cacgccgcca ccagtgtcca cagtggacgc 17160 
ggccattcag accgtggtgc gcggagcccg gcgctatgct aaaatgaaga gacggcggag 1722 0 
gcgcgtagca cgtcgccacc gccgccgacc cggcactgcc gcccaacgcg cggcggcggc 172 8 0 
cctgcttaac cgcgcacgtc gcaccggccg acgggcggcc atgcgggccg ctcgaaggct 17340 
ggccgcgggt attgtcactg tgccccccag gtccaggcga cgagcggccg ccgcagcagc 17400 
cgcggccatt agtgctatga ctcagggtcg caggggcaac gtgtattggg tgcgcgactc 174 6 0 
ggttagcggc ctgcgcgtgc ccgtgcgcac ccgccccccg cgcaactaga ttgcaagaaa 1752 0 
aaactactta gactcgtact gttgtatgta tccagcggcg gcggcgcgca acgaagctat 175 80 
gtccaagcgc aaaatcaaag aagagatgct ccaggtcatc gcgccggaga tctatggccc 17640 
cccgaagaag gaagagcagg attacaagcc ccgaaagcta aagcgggtca aaaagaaaaa 17700 
gaaagatgat gatgatgaac ttgacgacga ggtggaactg ctgcacgcta ccgcgcccag 17760 
gcgacgggta cagtggaaag gtcgacgcgt aaaacgtgtt ttgcgacccg gcaccaccgt 1782 0 
agtctttacg cccggtgagc gctccacccg cacctacaag cgcgtgtatg atgaggtgta 178 80 
cggcgacgag gacctgcttg agcaggccaa cgagcgcctc ggggagtttg cctacggaaa 17940 
gcggcataag gacatgctgg cgttgccgct ggacgagggc aacccaacac ctagcctaaa 180 00 
gcccgtaaca ctgcagcagg tgctgcccgc gcttgcaccg tccgaagaaa agcgcggcct 180 60 
aaagcgcgag tctggtgact tggcacccac cgtgcagctg atggtaccca agcgccagcg 1812 0 
actggaagat gtcttggaaa aaatgaccgt ggaacctggg ctggagcccg aggtccgcgt 18180 
gcggccaatc aagcaggtgg cgccgggact gggcgtgcag accgtggacg ttcagatacc 18240 
cactaccagt agcaccagta ttgccaccgc cacagagggc atggagacac aaacgtcccc 183 00 
ggttgcctca gcggtggcgg atgccgcggt gcaggcggtc gctgcggccg cgtccaagac 183 60 
ctctacggag gtgcaaacgg acccgtggat gtttcgcgtt tcagcccccc ggcgcccgcg 1842 0 
cggttcgagg aagtacggcg ccgccagcgc gctactgccc gaatatgccc tacatccttc 18480 
cattgcgcct acccccggct atcgtggcta cacctaccgc cccagaagac gagcaactac 18540 
ccgacgccga accaccactg gaacccgccg ccgccgtcgc cgtcgccagc ccgtgctggc 18600 
cccgatttcc gtgcgcaggg tggctcgcga aggaggcagg accctggtgc tgccaacagc 186 60 
gcgctaccac cccagcatcg tttaaaagcc ggtctttgtg gttcttgcag atatggccct 18720 
cacctgccgc ctccgtttcc cggtgccggg attccgagga agaatgcacc gtaggagggg 18780 
catggccggc cacggcctga cgggcggcat gcgtcgtgcg caccaccggc ggcggcgcgc 18840 
gtcgcaccgt cgcatgcgcg gcggtatcct gcccctcctt attccactga tcgccgcggc 18900 
gattggcgcc gtgcccggaa ttgcatccgt ggccttgcag gcgcagagac actgattaaa 18960 
aacaagttgc atgtggaaaa atcaaaataa aaagtctgga ctctcacgct cgcttggtcc 1902 0 
tgtaactatt ttgtagaatg gaagacatca actttgcgtc tctggccccg cgacacggct 19080 
cgcgcccgtt catgggaaac tggcaagata tcggcaccag caatatgagc ggtggcgcct 19140 
tcagctgggg ctcgctgtgg agcggcatta aaaatttcgg ttccaccgtt aagaactatg 19200 
gcagcaaggc ctggaacagc agcacaggcc agatgctgag ggataagttg aaagagcaaa 19260 
atttccaaca aaaggtggta gatggcctgg cctctggcat tagcggggtg gtggacctgg 19320 
ccaaccaggc agtgcaaaat aagattaaca gtaagcttga tccccgccct cccgtagagg 193 80 
agcctccacc ggccgtggag acagtgtctc cagaggggcg tggcgaaaag cgtccgcgcc 1944 0 
ccgacaggga agaaactctg gtgacgcaaa tagacgagcc tccctcgtac gaggaggcac 19500 
taaagcaagg cctgcccacc acccgtccca tcgcgcccat ggctaccgga gtgctgggcc 19560 
agcacacacc cgtaacgctg gacctgcctc cccccgccga cacccagcag aaacctgtgc 1962 0 
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tgccaggccc gaccgccgtt gttgtaaccc gtcctagccg cgcgtccctg cgccgcgccg 1968 0 
ccagcggtcc gcgatcgttg cggcccgtag ccagtggcaa ctggcaaagc acactgaaca 1974 0 
gcatcgtggg tctgggggtg caatccctga agcgccgacg atgcttctga atagctaacg 19 80 0 
tgtcgtatgt gtgtcatgta tgcgtccatg tcgccgccag aggagctgct gagccgccgc 19860 
gcgcccgctt tccaagatgg ctaccccttc gatgatgccg cagtggtctt acatgcacat 1992 0 
ctcgggccag gacgcctcgg agtacctgag ccccgggctg gtgcagtttg cccgcgccac 19980 
cgagacgtac ttcagcctga ataacaagtt tagaaacccc acggtggcgc ctacgcacga 20040 
cgtgaccaca gaccggtccc agcgtttgac gctgcggttc atccctgtgg accgtgagga 20100 
tactgcgtac tcgtacaagg cgcggttcac cctagctgtg ggtgataacc gtgtgctgga 2 0160 
catggcttcc acgtactttg acatccgcgg cgtgctggac aggggcccta cttttaagcc 20220 
ctactctggc actgcctaca acgccctggc tcccaagggt gccccaaatc cttgcgaatg 2 0280 
ggatgaagct gctactgctc ttgaaataaa cctagaagaa gaggacgatg acaacgaaga 2 034 0 
cgaagtagac gagcaagctg agcagcaaaa aactcacgta tttgggcagg cgccttattc 2 0400 
tggtataaat attacaaagg agggtattca aataggtgtc gaaggtcaaa cacctaaata 2 0460 
tgccgataaa acatttcaac ctgaacctca aataggagaa tctcagtggt acgaaactga 2 052 0 
aattaatcat gcagctggga gagtccttaa aaagactacc ccaatgaaac catgttacgg 20580 
ttcatatgca aaacccacaa atgaaaatgg agggcaaggc attcttgtaa agcaacaaaa 20640 
tggaaagcta gaaagtcaag tggaaatgca atttttctca actactgagg cgaccgcagg 20700 
caatggtgat aacttgactc ctaaagtggt attgtacagt gaagatgtag atatagaaac 2 0760 
cccagacact catatttctt acatgcccac tattaaggaa ggtaactcac gagaactaat 20820 
gggccaacaa tctatgccca acaggcctaa ttacattgct tttagggaca attttattgg 20880 
tctaatgtat tacaacagca cgggtaatat gggtgttctg gcgggccaag catcgcagtt 2 0 940 
gaatgctgtt gtagatttgc aagacagaaa cacagagctt tcataccagc ttttgcttga 21000 
ttccattggt gatagaacca ggtacttttc tatgtggaat caggctgttg acagctatga 21060 
tccagatgtt agaattattg aaaatcatgg aactgaagat gaacttccaa attactgctt 2112 0 
tccactggga ggtgtgatta atacagagac tcttaccaag gtaaaaccta aaacaggtca 21180 
ggaaaatgga tgggaaaaag atgctacaga attttcagat aaaaatgaaa taagagttgg 2124 0 
aaataatttt gccatggaaa tcaatctaaa tgccaacctg tggagaaatt tcctgtactc 213 00 
caacatagcg ctgtatttgc ccgacaagct aaagtacagt ccttccaacg taaaaatttc 213 60 
tgataaccca aacacctacg actacatgaa caagcgagtg gtggctcccg ggttagtgga 21420 
ctgctacatt aaccttggag cacgctggtc ccttgactat atggacaacg tcaacccatt 21480 
taaccaccac cgcaatgctg gcctgcgcta ccgctcaatg ttgctgggca atggtcgcta 2154 0 
tgtgcccttc cacat'ccagg tgcctcagaa gttctttgcc attaaaaacc tccttctcct 21600 
gccgggctca tacacctacg agtggaactt caggaaggat gttaacatgg ttctgcagag 21660 
ctccctagga aatgacctaa gggttgacgg agccagcatt aagtttgata gcatttgcct 2172 0 
ttacgccacc ttcttcccca tggcccacaa caccgcctcc acgcttgagg ccatgcttag 21780 
aaacgacacc aacgaccagt cctttaacga ctatctctcc gccgccaaca tgctctaccc 21840 
tatacccgcc aacgctacca acgtgcccat atccatcccc tcccgcaact gggcggcttt 21900 
ccgcggctgg gccttcacgc gccttaagac taaggaaacc ccatcactgg gctcgggcta 21960 
cgacccttat tacacctact ctggctctat accctaccta gatggaacct tttacctcaa 22020 
ccacaccttt aagaaggtgg ccattacctt tgactcttct gtcagctggc ctggcaatga 22080 
ccgcctgctt acccccaacg agtttgaaat taagcgctca gttgacgggg agggttacaa 22140 
cgttgcccag tgtaacatga ccaaagactg gttcctggta caaatgctag ctaactacaa 222 00 
cattggctac cagggcttct atatcccaga gagctacaag gaccgcatgt actccttctt 2 2260 
tagaaacttc cagcccatga gccgtcaggt ggtggatgat actaaataca aggactacca 2 232 0 
acaggtgggc atcctacacc aacacaacaa ctctggattt gttggctacc ttgcccccac 223 80 
catgcgcgaa ggacaggcct accctgctaa cttcccctat ccgcttatag gcaagaccgc 22440 
agttgacagc attacccaga aaaagtttct ttgcgatcgc accctttggc gcatcccatt 22500 
ctccagtaac tttatgtcca tgggcgcact cacagacctg ggccaaaacc ttctctacgc 22560 
caactccgcc cacgcgctag acatgacttt tgaggtggat cccatggacg agcccaccct 22 62 0 
tctttatgtt ttgtttgaag tctttgacgt ggtccgtgtg caccggccgc accgcggcgt 2 2680 
catcgaaacc gtgtacctgc gcacgccctt ctcggccggc aacgccacaa cataaagaag 22740 
caagcaacat caacaacagc tgccgccatg ggctccagtg agcaggaact gaaagccatt 22 8 00 
gtcaaagatc ttggttgtgg gccatatttt ttgggcacct atgacaagcg ctttccaggc 22860 
tttgtttctc cacacaagct cgcctgcgcc atagtcaata cggccggtcg cgagactggg 22920 
ggcgtacact ggatggcctt tgcctggaac ccgcactcaa aaacatgcta cctctttgag 22980 
ccctttggct tttctgacca gcgactcaag caggtttacc agtttgagta cgagtcactc 2 3 040 
ctgcgccgta gcgccattgc ttcttccccc gaccgctgta taacgctgga aaagtccacc 23100 
caaagcgtac aggggcccaa ctcggccgcc tgtggactat tctgctgcat gtttctccac 23160 
gcctttgcca actggcccca aactcccatg gatcacaacc ccaccatgaa ccttattacc 2322 0 
ggggtaccca actccatgct caacagtccc caggtacagc ccaccctgcg tcgcaaccag 232 80 
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gaacagctct acagcttcct ggagcgccac tcgccctact tccgcagcca cagtgcgcag 23340 
attaggagcg ccacttcttt ttgtcacttg aaaaacatgt aaaaataatg tactagagac 23400 
actttcaata aaggcaaatg cttttatttg tacactctcg ggtgattatt tacccccacc 23460 
cttgccgtct gcgccgtttg gggaggcggc ggcgacgggg acggggacga cacgtcctcc 2 3520 
atggttgggg gacgtcgcgc cgcaccgcgt ccgcgctcgg gggtggtttc gcgctgctcc 23580 
tcttcccgac tggccatttc cttctcctat aggcagaaaa agatcatgga gtcagtcgag 23 64 0 
aagaaggaca gcctaaccgc cccctctgag ttcgccacca ccgcctccac cgatgccgcc 23700 
aacgcgccta ccaccttccc cgtcgaggca cccccgcttg aggaggagga agtgattatc 23760 
gagcaggacc caggttttgt aagcgaagac gacgaggacc gctcagtacc aacagaggat 23 82 0 
aaaaagcaag accaggacaa cgcagaggca aacgaggaac aagtcgggcg gggggacgaa 23 88 0 
aggcatggcg actacctaga tgtgggagac gacgtgctgt tgaagcatct gcagcgccag 23 94 0 
tgcgccatta tctgcgacgc gttgcaagag cgcagcgatg tgcccctcgc catagcggat 24000 
gtcagccttg cctacgaacg ccacctattc tcaccgcgcg taccccccaa acgccaagaa 24060 
aacggcacat gcgagcccaa cccgcgcctc aacttctacc ccgtatttgc cgtgccagag 2412 0 
gtgcttgcca cctatcacat ctttttccaa aactgcaaga tacccctatc ctgccgtgcc 24180 
aaccgcagcc gagcggacaa gcagctggcc ttgcggcagg gcgctgtcat acctgatatc 24240 
gcctcgctca acgaagtgcc aaaaatcttt gagggtcttg gacgcgacga gaagcgcgcg 24300 
gcaaacgctc tgcaacagga aaacagcgaa aatgaaagtc actctggagt gttggtggaa 2436 0 
ctcgagggtg acaacgcgcg cctagccgta ctaaaacgca gcatcgaggt cacccacttt 2442 0 
gcctacccgg cacttaacct accccccaag gtcatgagca cagtcatgag tgagctgatc 24480 
gtgcgccgtg cgcagcccct ggagagggat gcaaatttgc aagaacaaac agaggagggc 2454 0 
ctacccgcag ttggcgacga gcagctagcg cgctggcttc aaacgcgcga gcctgccgac 24600 
ttggaggagc gacgcaaact aatgatggcc gcagtgctcg ttaccgtgga gcttgagtgc 24660 
atgcagcggt tctttgctga cccggagatg cagcgcaagc tagaggaaac attgcactac 24720 
acctttcgac agggctacgt acgccaggcc tgcaagatct ccaacgtgga gctctgcaac 24780 
ctggtctcct accttggaat tttgcacgaa aaccgccttg ggcaaaacgt gcttcattcc 24840 
acgctcaagg gcgaggcgcg ccgcgactac gtccgcgact gcgtttactt atttctatgc 249 0 0 
tacacctggc agacggccat gggcgtttgg cagcagtgct tggaggagtg caacctcaag 24 960 
gagctgcaga aactgctaaa gcaaaacttg aaggacctat ggacggcctt caacgagcgc 2 5 020 
tccgtggccg cgcacctggc ggacatcatt ttccccgaac gcctgcttaa aaccctgcaa 25080 
cagggtctgc cagacttcac cagtcaaagc atgttgcaga actttaggaa ctttatccta 25140 
gagcgctcag gaatcttgcc cgccacctgc tgtgcacttc ctagcgactt tgtgcccatt 252 00 
aagtaccgcg aatgccctcc gccgctttgg ggccactgct accttctgca gctagccaac 25260 
taccttgcct accactctga cataatggaa gacgtgagcg gtgacggtct actggagtgt 25320 
cactgtcgct gcaacctatg caccccgcac cgctccctgg tttgcaattc gcagctgctt 253 80 
aacgaaagtc aaattatcgg tacctttgag ctgcagggtc cctcgcctga cgaaaagtcc 25440 
gcggctccgg ggttgaaact cactccgggg ctgtggacgt cggcttacct tcgcaaattt 25500 
gtacctgagg actaccacgc ccacgagatt aggttctacg aagaccaatc ccgcccgcca 25560 
aatgcggagc ttaccgcctg cgtcattacc cagggccaca ttcttggcca attgcaagcc 25620 
atcaacaaag cccgccaaga gtttctgcta cgaaagggac ggggggttta cttggacccc 25680 
cagtccggcg aggagctcaa cccaatcccc ccgccgccgc agccctatca gcagcagccg 25740 
cgggcccttg cttcccagga tggcacccaa aaagaagctg cagctgccgc cgccacccac 25800 
ggacgaggag gaatactggg acagtcaggc agaggaggtt ttggacgagg aggaggagga 2 58 60 
catgatggaa gactgggaga gcctagacga ggaagcttcc gaggtcgaag aggtgtcaga 2 5 920 
cgaaacaccg tcaccctcgg tcgcattccc ctcgccggcg ccccagaaat cggcaaccgg 25980 
ttccagcatg gctacaacct ccgctcctca ggcgccgccg gcactgcccg ttcgccgacc 26040 
caaccgtaga tgggacacca ctggaaccag ggccggtaag tccaagcagc cgccgccgtt 26100 
agcccaagag caacaacagc gccaaggcta ccgctcatgg cgcgggcaca agaacgccat 26160 
agttgcttgc ttgcaagact gtgggggcaa catctccttc gcccgccgct ttcttctcta 2 6220 
ccatcacggc gtggccttcc cccgtaacat cctgcattac taccgtcatc tctacagccc 26280 
atactgcacc ggcggcagcg gcagcggcag caacagcagc ggccacacag aagcaaaggc 26340 
gaccggatag caagactctg acaaagccca agaaatccac agcggcggca gcagcaggag 2 6400 
gaggagcgct gcgtctggcg cccaacgaac ccgtatcgac ccgcgagctt agaaacagga 26460 
tttttcccac tctgtatgct atatttcaac agagcagggg ccaagaacaa gagctgaaaa 2 652 0 
taaaaaacag gtctctgcga tccctcaccc gcagctgcct gtatcacaaa agcgaagatc 26580 
agcttcggcg cacgctggaa gacgcggagg ctctcttcag taaatactgc gcgctgactc 26640 
ttaaggacta gtttcgcgcc ctttctcaaa tttaagcgcg aaaactacgt catctccagc 26700 
ggccacaccc ggcgccagca cctgtcgtca gcgccattat gagcaaggaa attcccacgc 26760 
cctacatgtg gagttaccag ccacaaatgg gacttgcggc tggagctgcc caagactact 26820 
caacccgaat aaactacatg agcgcgggac cccacatgat atcccgggtc aacggaatcc 26880 
gcgcccaccg aaaccgaatt ctcttggaac aggcggctat taccaccaca cctcgtaata 2 6940 
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accttaatcc ccgtagttgg cccgctgccc 
ctgtggtact tcccagagac gcccaggccg 
ttgcgggcgg ctttcgtcac agggtgcggt 
tcagagggcg aggtattcag ctcaacgacg 
cggacgggac atttcagatc ggcggcgccg 
tcctaactct gcagacctcg tcctctgagc 
ttattgagga gtttgtgcca tcggtctact 
atccggatca atttattcct aactttgacg 
gaatgttaag tggagaggca gagcaactgc 
acaagtgctt tgcccgcgac tccggtgagt 
tcgagggccc ggcgcacggc gtccggctta 
ttcgggagtt tacccagcgc cccctgctag 
ctgtgatttg caactgtcct aaccttggat 
ctgagtataa taaatacaga aattaaaata 
gccaccgtct tcacccgccc aagcaaacca 
tctccctctg tgatttacaa cagtttcaac 
tccgagctca gctactccat cagaaaaaac 
agtgcgtcac cggccgctgc accacaccta 
acagacctca ataactctgt ttaccagaac 
ttaggccaaa ggcgcagcta ctgtggggtt 
ttctaattca ggtttctcta gaaatggacg 
gacgcagggc agcggccgag caacagcgca 
tgcaccagtg caaaaggggt atcttttgtc 
gtaataccac cggacaccgc cttagctaca 
tcatggtggg agaaaagccc attaccataa 
ttcactcacc ttgtcaagga cctgaggatc 
tcaaagatct tattcccttt aactaataaa 
tcagttagca aatttctgtc cagtttattc 
tggtattgca gcttcctcct ggctgcaaac 
tcctcctgtt cctgtccatc cgcacccact 
agaccgtctg aagatacctt caaccccgtg 
actgtgcctt ttcttactcc tccctttgta 
ggggtactct ctttgcgcct atccgaacct 
aaaatgggca acggcctctc tctggacgag 
actgtgagcc cacctctcaa aaaaaccaag 
ctcacagtta cctcagaagc cctaactgtg 
aacacactca ccatgcaatc acaggccccg 
gccacccaag gacccctcac agtgtcagaa 
ctcaccacca ccgatagcag tacccttact 
actggtagct tgggcattga cttgaaagag 
ctaaagtacg gggctccttt gcatgtaaca 
ggtccaggtg tgactattaa taatacttcc 
tttgattcac aaggcaatat gcaacttaat 
aacagacgcc ttatacttga tgttagttat 
agactaggac agggccctct ttttataaac 
aaaggccttt acttgtttac agcttcaaac 
actgccaagg ggttgatgtt tgacgctaca 
gaatttggtt cacctaatgc accaaacaca 
ctagaatttg attcaaacaa ggctatggtt 
agcacaggtg ccattacagt aggaaacaaa 
ccagctccat ctcctaactg tagactaaat 
ttaacaaaat gtggcagtca aatacttgct 
ttggctccaa tatctggaac agttcaaagt 
ggagtgctac taaacaattc cttcctggac 
cttactgaag gcacagccta tacaaacgct 
ccaaaatctc acggtaaaac tgccaaaagt 
gacaaaacta aacctgtaac actaaccatt 
acaactccaa gtgcatactc tatgtcattt 
aatgaaatat ttgccacatc ctcttacact 
gtttgtgtta tgtttcaacg tgtttatttt 
attcagtagt atagccccac caccacatag 



tggtgtacca ggaaagtccc gctcccacca 27000 
aagttcagat gactaactca ggggcgcagc 2 70 60 
cgcccgggca gggtataact cacctgacaa 2 712 0 
agtcggtgag ctcctcgctt ggtctccgtc 27180 
gccgtccttc attcacgcct cgtcaggcaa 27240 
cgcgctctgg aggcattgga actctgcaat 2 7300 
ttaacccctt ctcgggacct cccggccact 273 60 
cggtaaagga ctcggcggac ggctacgact 27420 
gcctgaaaca cctggtccac tgtcgccgcc 274 80 
tttgctactt tgaattgccc gaggatcata 27540 
ccgcccaggg agagcttgcc cgtagcctga 27600 
ttgagcggga caggggaccc tgtgttctca 27660 
tacatcaaga tctttgttgc catctctgtg 27720 
tactggggct cctatcgcca tcctgtaaac 27780 
aggcgaacct tacctggtac ttttaacatc 27840 
ccagacggag tgagtctacg agagaacctc 2 79 00 
accaccctcc ttacctgccg ggaacgtacg 27960 
ccgcctgacc gtaaaccaga ctttttccgg 28020 
aggaggtgag cttagaaaac ccttagggta 2 80 80 
tatgaacaat tcaagcaact ctacgggcta 28140 
gaattattac agagcagcgc ctgctagaaa 2 8200 
tgaatcaaga gctccaagac atggttaact 2 8260 
tggtaaagca ggccaaagtc acctacgaca 2 832 0 
agttgccaac caagcgtcag aaattggtgg 2 8380 
ctcagcactc ggtagaaacc gaaggctgca 2 844 0 
tctgcaccct tattaagacc ctgtgcggtc 28500 
aaaaaataat aaagcatcac ttacttaaaa 2 85 60 
agcagcacct ccttgccctc ctcccagctc 28620 
tttctccaca atctaaatgg aatgtcagtt 2 8680 
atcttcatgt tgttgcagat gaagcgcgca 2 8740 
tatccatatg acacggaaac cggtcctcca 2 8800 
tcccccaatg ggtttcaaga gagtccccct 2 8860 
ctagttacct ccaatggcat gcttgcgctc 2 8920 
gccggcaacc ttacctccca aaatgtaacc 2 8980 
tcaaacataa acctggaaat atctgcaccc 2 9040 
gctgccgccg cacctctaat ggtcgcgggc 29100 
ctaaccgtgc acgactccaa acttagcatt 29160 
ggaaagctag ccctgcaaac atcaggcccc 29220 
atcactgcct caccccctct aactactgcc 292 80 
cccatttata cacaaaatgg aaaactagga 29340 
gacgacctaa acactttgac cgtagcaact 29400 
ttgcaaacta aagttactgg agccttgggt 2 94 60 
gtagcaggag gactaaggat tgattctcaa 2 952 0 
ccgtttgatg ctcaaaacca actaaatcta 29580 
tcagcccaca acttggatat taactacaac 29640 
aattccaaaa agcttgaggt taacctaagc 2 9700 
gccatagcca ttaatgcagg agatgggctt 2 9760 
aatcccctca aaacaaaaat tggccatggc 2 9820 
cctaaactag gaactggcct tagttttgac 29880 
aataatgata agctaacttt gtggaccaca 2 9940 
gcagagaaag atgctaaact cactttggtc 3 00 00 
acagtttcag ttttggctgt taaaggcagt 30060 
gctcatctta ttataagatt tgacgaaaat 3 012 0 
ccagaatatt ggaactttag aaatggagat 3 0180 
gttggattta tgcctaacct atcagcttat 3 0240 
aacattgtca gtcaagttta cttaaacgga 3 03 00 
acactaaacg gtacacagga aacaggagac 3 03 60 
tcatgggact ggtctggcca caactacatt 3 042 0 
ttttcataca ttgcccaaga ataaagaatc 3 0480 
tcaattgcag aaaatttcaa gtcatttttc 3 0540 
cttatacaga tcaccgtacc ttaatcaaac 30600 
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tcacagaacc ctagtattca acctgccacc tccctcccaa cacacagagt acacagtcct 3 0 660 
ttctccccgg ctggccttaa aaagcatcat atcatgggta acagacatat tcttaggtgt 30720 
tatattccac acggtttcct gtcgagccaa acgctcatca gtgatattaa taaactcccc 30780 
gggcagctca cttaagttca tgtcgctgtc cagctgctga gccacaggct gctgtccaac 30 840 
ttgcggttgc ttaacgggcg gcgaaggaga agtccacgcc tacatggggg tagagtcata 3 0 900 
atcgtgcatc aggatagggc ggtggtgctg cagcagcgcg cgaataaact gctgccgccg 3 0 960 
ccgctccgtc ctgcaggaat acaacatggc agtggtctcc tcagcgatga ttcgcaccgc 31020 
ccgcagcata aggcgccttg tcctccgggc acagcagcgc accctgatct cacttaaatc 3108 0 
agcacagtaa ctgcagcaca gcaccacaat attgttcaaa atcccacagt gcaaggcgct 3114 0 
gtatccaaag ctcatggcgg ggaccacaga acccacgtgg ccatcatacc acaagcgcag 312 0 0 
gtagattaag tggcgacccc tcataaacac gctggacata aacattacct cttttggcat 31260 
gttgtaattc accacctccc ggtaccatat aaacctctga ttaaacatgg cgccatccac 3132 0 
caccatccta aaccagctgg ccaaaacctg cccgccggct atacactgca gggaaccggg 313 8 0 
actggaacaa tgacagtgga gagcccagga ctcgtaacca tggatcatca tgctcgtcat 3144 0 
gatatcaatg ttggcacaac acaggcacac gtgcatacac ttcctcagga ttacaagctc 31500 
ctcccgcgtt agaaccatat cccagggaac aacccattcc tgaatcagcg taaatcccac 3156 0 
actgcaggga agacctcgca cgtaactcac gttgtgcatt gtcaaagtgt tacattcggg 31620 
cagcagcgga tgatcctcca gtatggtagc gcgggtttct gtctcaaaag gaggtagacg 31680 
atccctactg tacggagtgc gccgagacaa ccgagatcgt gttggtcgta gtgtcatgcc 31740 
aaatggaacg ccggacgtag tcatatttcc tgaagcaaaa ccaggtgcgg gcgtgacaaa 31800 
cagatctgcg tctccggtct cgccgcttag atcgctctgt gtagtagttg tagtatatcc 31860 
actctctcaa agcatccagg cgccccctgg cttcgggttc tatgtaaact ccttcatgcg 31920 
ccgctgccct gataacatcc accaccgcag aataagccac acccagccaa cctacacatt 31980 
cgttctgcga gtcacacacg ggaggagcgg gaagagctgg aagaaccatg tttttttttt 32040 
tattccaaaa gattatccaa aacctcaaaa tgaagatcta ttaagtgaac gcgctcccct 32100 
ccggtggcgt ggtcaaactc tacagccaaa gaacagataa tggcatttgt aagatgttgc 32160 
acaatggctt ccaaaaggca aacggccctc acgtccaagt ggacgtaaag gctaaaccct 32220 
tcagggtgaa tctcctctat aaacattcca gcaccttcaa ccatgcccaa ataattctca 32280 
tctcgccacc ttctcaatat atctctaagc aaatcccgaa tattaagtcc ggccattgta 32340 
aaaatctgct ccagagcgcc ctccaccttc agcctcaagc agcgaatcat gattgcaaaa 32400 
attcaggttc ctcacagacc tgtataagat tcaaaagcgg aacattaaca aaaataccgc 32460 
gatcccgtag gtcccttcgc agggccagct gaacataatc gtgcaggtct gcacggacca 3252 0 
gcgcggccac ttccccgcca ggaaccttga caaaagaacc cacactgatt atgacacgca 325 80 
tactcggagc tatgctaacc agcgtagccc cgatgtaagc tttgttgcat gggcggcgat 32640 
ataaaatgca aggtgctgct caaaaaatca ggcaaagcct cgcgcaaaaa agaaagcaca 327 00 
tcgtagtcat gctcatgcag ataaaggcag gtaagctccg gaaccaccac agaaaaagac 32760 
accatttttc tctcaaacat gtctgcgggt ttctgcataa acacaaaata aaataacaaa 32820 
aaaacattta aacattagaa gcctgtctta caacaggaaa aacaaccctt ataagcataa 3 28 80 
gacggactac ggccatgccg gcgtgaccgt aaaaaaactg gtcaccgtga ttaaaaagca 32940 
ccaccgacag ctcctcggtc atgtccggag tcataatgta agactcggta aacacatcag 330 00 
gttgattcat cggtcagtgc taaaaagcga ccgaaatagc ccgggggaat acatacccgc 33 060 
aggcgtagag acaacattac agcccccata ggaggtataa caaaattaat aggagagaaa 33120 
aacacataaa cacctgaaaa accctcctgc ctaggcaaaa tagcaccctc ccgctccaga 33180 
acaacataca gcgcttcaca gcggcagcct aacagtcagc cttaccagta aaaaagaaaa 33240 
cctattaaaa aaacaccact cgacacggca ccagctcaat cagtcacagt gtaaaaaagg 333 0 0 
gccaagtgca gagcgagtat atataggact aaaaaatgac gtaacggtta aagtccacaa 33360 
aaaacaccca gaaaaccgca cgcgaaccta cgcccagaaa cgaaagccaa aaaacccaca 33420 
acttcctcaa atcgtcactt ccgttttccc acgttacgta acttcccatt ttaagaaaac 33480 
tacaattccc aacacataca agttactccg ccctaaaacc tacgtcaccc gccccgttcc 33540 
cacgccccgc gccacgtcac aaactccacc ccctcattat catattggct tcaatccaaa 33600 
ataaggtata ttattgatga tg 33622 

<210> 45 
<211> 1746 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 5F KOI 



<400> 45 
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atgaagcgcg 
accggtcctc 
gagagtcccc 
atgcttgcgc 
caaaatgtaa 
atatctgcac 
atggtcgcgg 
aaacttagca 
acatcaggcc 
ctaactactg 
ggaaaactag 
accgtagcaa 
ggagccttgg 
attgattctc 
caactaaatc 
attaactaca 
gttaacctaa 
ggagatgggc 
attggccatg 
cttagttttg 
ttgtggacca 
ctcactttgg 
gttaaaggca 
tttgacgaaa 
agaaatggag 
ctatcagctt 
tacttaaacg 
gaaacaggag 
cacaactaca 
gaataa 



caagaccgtc 
caactgtgcc 
ctggggtact 
tcaaaatggg 
ccactgtgag 
ccctcacagt 
gcaacacact 
ttgccaccca 
ccctcaccac 
ccactggtag 
gactaaagta 
ctggtccagg 
gttttgattc 
aaaacagacg 
taagactagg 
acaaaggcct 
gcactgccaa 
ttgaatttgg 
gcctagaatt 
acagcacagg 
caccagctcc 
tcttaacaaa 
gtttggctcc 
atggagtgct 
atcttactga 
atccaaaatc 
gagacaaaac 
acacaactcc 
ttaatgaaat 



tgaagatacc 
ttttcttact 
ctctttgcgc 
caacggcctc 
cccacctctc 
tacctcagaa 
caccatgcaa 
aggacccctc 
caccgatagc 
cttgggcatt 
cggggctcct 
tgtgactatt 
acaaggcaat 
ccttatactt 
acagggccct 
ttacttgttt 
ggggttgatg 
ttcacctaat 
tgattcaaac 
tgccattaca 
agaggctaac 
atgtggcagt 
aatatctgga 
actaaacaat 
aggcacagcc 
tcacggtaaa 
taaacctgta 
aagtgcatac 
atttgccaca 



ttcaaccccg 
cctccctttg 
ctatccgaac 
tctctggacg 
aaaaaaacca 
gccctaactg 
tcacaggccc 
acagtgtcag 
agtaccctta 
gacttgaaag 
ttgcatgtaa 
aataatactt 
atgcaactta 
gatgttagtt 
ctttttataa 
acagcttcaa 
tttgacgcta 
gcaccaaaca 
aaggctatgg 
gtaggaaaca 
tgtagactaa 
caaatacttg 
acagttcaaa 
tccttcctgg 
tatacaaacg 
actgccaaaa 
acactaacca 
tctatgtcat 
tcctcttaca 



tgtatccata 
tatcccccaa 
ctctagttac 
aggccggcaa 
agtcaaacat 
tggctgccgc 
cgctaaccgt 
aaggaaagct 
ctatcactgc 
agcccattta 
cagacgacct 
ccttgcaaac 
atgtagcagg 
atccgtttga 
actcagccca 
acaattccaa 
cagccatagc 
caaatcccct 
ttcctaaact 
aaaataatga 
atgcagagaa 
ctacagtttc 
gtgctcatct 
acccagaata 
ctgttggatt 
gtaacattgt 
ttacactaaa 
tttcatggga 
ctttttcata 



tgacacggaa 
tgggtttcaa 
ctccaatggc 
ccttacctcc 
aaacctggaa 
cgcacctcta 
gcacgactcc 
agccctgcaa 
ctcaccccct 
tacacaaaat 
aaacactttg 
taaagttact 
aggactaagg 
tgctcaaaac 
caacttggat 
aaagcttgag 
cattaatgca 
caaaacaaaa 
aggaactggc 
taagctaact 
agatgctaaa 
agttttggct 
tattataaga 
ttggaacttt 
tatgcctaac 
cagtcaagtt 
cggtacacag 
ctggtctggc 
cattgcccaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1746. 



<210> 46 
<211> 581 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> 5F KOI 



<400> 46 



Met 


Lys 


Arg 


Ala 


Arg 


Pro 


Ser 


Glu 


Asp 


Thr 


Phe Asn 


Pro 


Val 


Tyr 


Pro 


1 




5 










10 








15 




Tyr 


Asp 


Thr 


Glu 


Thr 


Gly 


Pro 


Pro 


Thr 


Val 


Pro Phe 


Leu 


Thr 


Pro 


Pro 




20 








25 








30 






Phe 


Val 


Ser 


Pro 


Asn 


Gly 


Phe 


Gin 


Glu 


Ser 


Pro Pro 


Gly 


Val 


Leu 


Ser 






35 








40 








45 








Leu 


Arg 


Leu 


Ser 


Glu 


Pro 


Leu 


Val 


Thr 


Ser 


Asn Gly 


Met 


Leu 


Ala 


Leu 




50 










55 








60 










Lys 


Met 


Gly 


Asn 


Gly 


Leu 


Ser 


Leu 


Asp 


Glu 


Ala Gly 


Asn 


Leu 


Thr 


Ser 


65 








70 










75 








80 


Gin 


Asn 


Val 


Thr 


Thr 


Val 


Ser 


Pro 


Pro 


Leu 


Lys Lys 


Thr 


Lys 


Ser 


Asn 










85 










90 








95 




He 


Asn 


Leu 


Glu 


He 


Ser 


Ala 


Pro 


Leu 


Thr 


Val Thr 


Ser 


Glu 


Ala 


Leu 








100 










105 








110 






Thr 


Val 


Ala 


Ala 


Ala 


Ala 


Pro 


Leu 


Met 


Val 


Ala Gly 


Asn 


Thr 


Leu 


Thr 






115 










120 








125 








Met 


Gin 


Ser 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


His 


Asp Ser 


Lys 


Leu 


Ser 


lie 




130 










135 








140 










Ala 


Thr 


Gin 


Gly 


Pro 


Leu 


Thr 


Val 


Ser 


Glu 


Gly Lys 


Leu 


Ala 


Leu 


Gin 


145 








150 










155 








160 


Thr 


Ser 


Gly 


Pro 


Leu 


Thr 


Thr 


Thr 


Asp 


Ser 


Ser Thr 


Leu 


Thr 


He 


Thr 
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JLDO 










170 










175 




Ala 


Ser 


Pro 


Pro 
180 


Leu 


Thr 


Thr 


Ala 


Thr 
185 


Gly 


Ser 


Leu 


Glv 


He 
190 


Asp 


Leu 


Lys 


Glu 


Pro 


He 


j.yr 


Thr 


Gin 


Asn 


Gly Lys 


Leu 


Glv 


Leu 


Lys 


Tyr 


Glv 




195 










200 










2 05 








Ala 


Pro 


Leu 


His 


Val 


Thr Asp Asp 


Leu 


Asn 


Thr 


Leu 


Thr 


Val 


Ala 


Thr 




210 










A JLo 










22 0 










Gly 


Pro 


Gly Val 


Thr 


He 


Asn 


Asn 


Thr 


Ser 


Leu 


Gin 


Thr 


Lys 


Val 


Thr 


225 










230 










9 R 

Zi -3 3 










240 


Gly Ala 


Leu Gly 


Phe 


Asp 


Ser 


uin 


Gly Asn 


Met 


Gin 


Leu 


Asn 


Val 


Ala 










245 










250 










255 




Gly Gly 


Leu Arg 


He 


Asp 


Ser 




Asn Arg 




Leu 


He 


Leu 


Asp 


Val 








260 










265 










270 






Ser 


Tyr 


Pro 


Phe 


Asp 


Ala 


Gin 


Asn 


Gin 


Leu 


Asn 


Leu 


Arg 


Leu Gly 


Gin 




275 








A o U 










285 








Gly 


Pro 


Leu 


Phe 


He 


Asn 


Ser 


Ala 


His 


Asn 


Leu 


Asp 


He 


Asn 


Tvr 
y 


Asn 


290 










O Q d 

A y b 










300 










Lys 


Gly 


Leu 


Tyr 


Leu 


Phe 


JLI1J-. 


Til Pi 


Ser 


Asn 


Asn 


Ser 


Lys 


Lys 


Leu 


Glu 


305 






310 










315 










320 


Val 


Asn 


Leu 


Ser 


Thr 
325 


Ala 




Gly 


'Leu 


Met 
330 


Phe 


As P 


Ala 


Thr 


Ala 
335 


He 


Ala 


He 


Asn 


Ala 


Gly Asp 




Leu 


Glu 


Phe 


Gly 


Ser 


Pro 


Asn 


Ala 


Pro 








340 










345 










350 






Asn 


Thr 


Asn 
355 


Pro 


Leu 


Lys 


Tnr 


"a c n 

Ji o U 


He 


Gly 


His 


Glv 


Leu 
365 


Glu 


Phe 


Asp 


Ser 


Asn 


Lys 


Ala 


Met 


Val 


Pro 


uy o 


Leu 


Gly 


Thr 


Gly 


Leu 


Ser 


Phe 


Asp 




370 








J5 / 3 










3 80 










Ser 


Thr 


Gly 


Ala 


He 


Thr 


V cl-L 


Pi -tr 

\j-Ly 


Asn 


Lys 


Asn 


Asn 


Asp 


Lys 


Leu 


Thr 


385 








390 










3 95 








Ala 


400 


Leu 


Trp 


Thr 


Thr 


Pro 


Ala 


Pro 


m n 


Ala 


Asn 


Cys 


Arg 


Leu 


Asn 


Glu 








405 










410 










415 




Lys 


Asp 


Ala 


Lys 


Leu 


Thr 


Leu 


Val 


Leu 


Thr 


Lys 


Cys 


Gly 


Ser 


Gin 


He 




420 










425 










430 






Leu 


Ala 


Thr 
435 


Val 


Ser 


Val 


Leu 


Ala 
440 


Val 


Lys 


Glv 


Ser 


Leu 
445 


Ala 


Pro 


He 


Ser 


Gly 
450 


Thr 


Val 


Gin 


Ser 


Ala 
455 


His 


Leu 


He 


He 


Arg 
460 


Phe 


Asp 


Glu 


Asn 


Gly Val 


Leu 


Leu 


Asn 


Asn 


Ser 


Phe 


Leu Asp 


Pro 


Glu 


Tyr 


Trp 


Asn 


Phe 


465 










470 










475 










480 


Arg Asn 


Gly Asp 


Leu 


Thr 


Glu 


Gly 


Thr 


Ala 


Tyr 


Thr 


Asn 


Ala 


Val 


Gly 










485 










490 










495 




Phe 


Met 


Pro 


Asn 


Leu 


Ser 


Ala 


Tyr 


Pro 


Lys 


Ser 


His 


Gly 


Lys 


Thr 


Ala 








500 








505 










510 






Lys 


Ser 


Asn 


He 


Val 


Ser 


Gin 


Val 


Tyr 


Leu 


Asn 


Gly Asp 


Lys 


Thr 


Lys 




515 










520 










525 








Pro 


Val 


Thr 


Leu 


Thr 


He 


Thr 


Leu 


Asn Gly 


Thr 


Gin 


Glu 


Thr 


Gly 


Asp 




530 










535 










540 










Thr 


Thr 


Pro 


Ser 


Ala 


Tyr 


Ser 


Met 


Ser 


Phe 


Ser 


Trp 


Asp 


Trp 


Ser 


Gly 


545 










550 










555 










560 


His 


Asn 


Tyr 


He 


Asn 


Glu 


He 


Phe 


Ala 


Thr 


Ser 


Ser 


Tyr 


Thr 


Phe 


Ser 








565 










570 










575 




Tyr 


He 


Ala 


Gin 


Glu 

























580 



<210> 47 
<211> 1776 
<212> DNA 

<213> Artificial Sequence 
<220> 



i 
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<223> 5F K01RGD 



<400> 47 

atgaagcgcg caagaccgtc tgaagatacc 
accggtcctc caactgtgcc ttttcttact- 
gagagtcccc ctggggtact ctctttgcgc 
atgcttgcgc tcaaaatggg caacggcctc 
caaaatgtaa ccactgtgag cccacctctc 
atatctgcac ccctcacagt tacctcagaa 
atggtcgcgg gcaacacact caccatgcaa 
aaacttagca ttgccaccca aggacccctc 
acatcaggcc ccctcaccac caccgatagc 
ctaactactg ccactggtag cttgggcatt 
ggaaaactag gactaaagta cggggctcct 
accgtagcaa ctggtccagg tgtgactatt 
ggagccttgg gttttgattc acaaggcaat 
attgattctc aaaacagacg ccttatactt 
caactaaatc taagactagg acagggccct 
attaactaca acaaaggcct ttacttgttt 
gttaacctaa gcactgccaa ggggttgatg 
ggagatgggc ttgaatttgg ttcacctaat 
attggccatg gcctagaatt tgattcaaac 
cttagttttg acagcacagg tgccattaca 
ttgtggacca caccagctcc atctcctaac 
ctcactttgg tcttaacaaa atgtggcagt 
gttaaaggca gtttggctcc aatatctgga 
tttgacgaaa atggagtgct actaaacaat 
agaaatggag atcttactga aggcacagcc 
ctatcagctt atccaaaatc tcacggtaaa 
tacttaaacg gagacaaaac taaacctgta 
gaaacaggtg atcattgtga ttgtcgtggt 
tctatgtcat tttcatggga ctggtctggc 
tcctcttaca ctttttcata cattgcccaa 



ttcaaccccg tgtatccata tgacacggaa 60 
cctccctttg tatcccccaa tgggtttcaa 120 
ctatccgaac ctctagttac ctccaatggc 180 
tctctggacg aggccggcaa ccttacctcc 240 
aaaaaaacca agtcaaacat aaacctggaa 3 00 
gccctaactg tggctgccgc cgcacctcta 3 60 
tcacaggccc cgctaaccgt gcacgactcc 42 0 
acagtgtcag aaggaaagct agccctgcaa 4 80 
agtaccctta ctatcactgc ctcaccccct 540 
gacttgaaag agcccattta tacacaaaat 600 
ttgcatgtaa cagacgacct aaacactttg 660 
aataatactt ccttgcaaac taaagttact 72 0 
atgcaactta atgtagcagg aggactaagg 7 80 
gatgttagtt atccgtttga tgctcaaaac 84 0 
ctttttataa actcagccca caacttggat 900 
acagcttcaa acaattccaa aaagcttgag 96 0 
tttgacgcta cagccatagc cattaatgca 102 0 
gcaccaaaca caaatcccct caaaacaaaa 1080 
aaggctatgg ttcctaaact aggaactggc 114 0 
gtaggaaaca aaaataatga taagctaact 12 00 
tgtagactaa atgcagagaa agatgctaaa 12 60 
caaatacttg ctacagtttc agttttggct 1320 
acagttcaaa gtgctcatct tattataaga 13 8 0 
tccttcctgg acccagaata ttggaacttt 1440 
tatacaaacg ctgttggatt tatgcctaac 15 00 
actgccaaaa gtaacattgt cagtcaagtt 15 60 
acactaacca ttacactaaa cggtacacag 162 0 
gattgttttt gtacaactcc aagtgcatac 1680 
cacaactaca ttaatgaaat atttgccaca 1740 
gaataa 1776 



<210> 48 
<211> 591 
<212> PRT 
<213> Artificial 



Sequence 



<220> 

<22 3> 5F KOIRGD 



<400> 48 



Met 


Lys 


Arg 


Ala 


Arg 


Pro 


Ser 


1 








5 






Tyr Asp 


Thr 


Glu 


Thr 


Gly 


Pro 








20 








Phe 


Val 


Ser 
35 


Pro 


Asn 


Gly 


Phe 


Leu Arg 


Leu 


Ser 


Glu 


Pro 


Leu 




50 










55 


Lys 


Met 


Gly 


Asn 


Gly 


Leu 


Ser 


65 










70 




Gin 


Asn 


Val 


Thr 


Thr 
85 


Val 


Ser 


lie 


Asn 


Leu 


Glu 
100 


He 


Ser 


Ala 


Thr 


Val 


Ala 
115 


Ala 


Ala 


Ala 


Pro 


Met 


Gin 
130 


Ser 


Gin 


Ala 


Pro 


Leu 
135 



Glu 


Asp 


Thr 


Phe 


Asn 


Pro 


Val 


Tyr 


Pro 




10 










15 




Pro 


Thr 


Val 


Pro 


Phe 


Leu 


Thr 


Pro 


Pro 




25 










30 






Gin 


Glu 


Ser 


Pro 


Pro 


Gly 


Val 


Leu 


Ser 


40 










45 








Val 


Thr 


Ser 


Asn Gly 


Met 


Leu 


Ala 


Leu 










60 










Leu 


Asp 


Glu 


Ala 


Gly 


Asn 


Leu 


Thr 


Ser 






75 










80 


Pro 


Pro 


Leu 


Lys 


Lys 


Thr 


Lys 


Ser 


Asn 






90 










95 




Pro 


Leu 


Thr 


Val 


Thr 


Ser 


Glu 


Ala 


Leu 




105 










110 






Leu 


Met 


Val 


Ala 


Gly 


Asn 


Thr 


Leu 


Thr 


120 










125 








Thr 


Val 


His 


Asp 


Ser 


Lys 


Leu 


Ser 


He 



140 
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Ala 


Thr Gin Gly 


Pro 


Leu 


Thr 


Val 


Ser Glu 


Gly 


Lys 


Leu 


Ala 


Leu 


Gin 


145 










X 3 \J 








155 










160 


Thr 


Ser 


Gly 


Pro 


Leu 


Thr 


Thr 


Thr 


Asp Ser 


Ser 


Thr 


Leu 


Thr 


He 


Thr 








165 








170 










175 




Ala 


Ser 


Pro 


Pro 
180 


Leu 


Thr 


Thr 


Ala 


Thr Gly 
185 


Ser 


Leu 


Glv 


He 
190 


Asp 


Leu 


Lys 


Glu 


Pro 


lie 


Tyr 


Thr 


Gin 


Asn 


Gly Lys 


Leu 


Gly 


Leu 


Lys 

JL 


Tyr 


Gly 




195 










200 








205 








Ala 


Pro 
210 


Leu 


His 


Val 


Thr 


Asp 
215 


Asp 


Leu Asn 


Thr 


Leu 

220 


Thr 


Val 


Ala 


Thr 


Gly 


Pro 


Gly Val 


Thr 


He 


Asn 


Asn 


Thr Ser 


Leu 


Gin 


Thr 


Lys 


Val 


Thr 


225 










9 "3 n 

/5 J U 








235 










240 


Gly Ala 


Leu 


Gly 


Phe 


Asp 


Ser 


Gin Gly Asn 


Met 


Gin 


Leu 


Asn 


Val 


Ala 








245 








250 










255 




Gly Gly Leu Arg 


He 


Asp 


Ser 


Gin Asn Arg Arg 


Leu 


He 


Leu 


Asp 


Val 








260 










265 








270 






Ser 


Tyr 


Pro 


Phe Asp 


Ala 


Gin 


Asn 


Gin Leu 


Asn 


Leu 


Arg 


Leu 


Gly 


Gin 




275 










280 








285 








Gly Pro 


Leu 


Phe 


He 


Asn 


Ser 


Ala 


His Asn 


Leu 


Asp 


He 


Asn 


Tyr 


Asn 




290 










z> ~> 








300 










Lys 


Gly 


Leu 


Tyr 


Leu 


Phe 


Thr 


Ala 


Ser Asn 


Asn 


Ser 


Lys 


Lys 


Leu 


Glu 


305 






310 








315 










320 


Val 


Asn 


Leu 


Ser 


Thr 
325 


Ala 


Lys 


Gly 


Leu Met 
330 


Phe 


Asp 


Ala 


Thr 


Ala 
335 


He 


Ala 


lie Asn Ala Gly 


Asp 


Gly 


Leu 


Glu Phe 


Gly 


Ser 


Pro 


Asn 


Ala 


Pro 








340 










345 








350 






Asn 


Thr 


Asn 


Pro 


Leu 


Lys 


Thr 


Lys 


He Gly His 


Gly 


Leu 


Glu 


Phe 


Asp 






355 










360 








365 








Ser 


Asn 


Lys 


Ala 


Met 


Val 


Pro 


Lys 


Leu Gly Thr 


Gly 


Leu 


Ser 


Phe 


Asp 




370 








375 








380 










Ser Thr Gly Ala 


He 


Thr 


Val 


Gly Asn Lys 


Asn 


Asn Asp 


Lys 


Leu 


Thr 


385 










-3QA 
J J V 








395 










400 


Leu 


Trp 


Thr 


Thr 


Pro 


Ala 


Pro 


Glu 


Ala Asn 


C Y S 


Arg 


Leu 


Asn 


Ala 


Glu 








405 








410 










415 




Lys 


Asp 


Ala 


Lys 


Leu 


Thr 


Leu 


Val 


Leu Thr 


Lys 


Cys 


Gly 


Ser 


Gin 


He 




420 










425 








430 






Leu 


Ala 


Thr 
435 


Val 


Ser 


Val 


Leu 


Ala 
440 


Val Lys 


Gly 


Ser 


Leu 
445 


Ala 


Pro 


He 


Ser 


Gly 
450 


Thr 


Val 


Gin 


Ser 


Ala 
455 


His 


Leu He 


He 


Arg 
460 


Phe 


Asp 


Glu 


Asn 


Gly Val 


Leu 


Leu 


Asn 


Asn 


Ser 


Phe 


Leu Asp 


Pro 


Glu 


Tyr 


Trp 


Asn 


Phe 


465 










470 








475 










480 


Arg Asn Gly Asp 


Leu 


Thr 


Glu 


Gly 


Thr Ala 


Tyr 


Thr 


Asn 


Ala 


Val 


Gly 










485 








490 










495 




Phe 


Met 


Pro 


Asn 
500 


Leu 


Ser 


Ala 


Tyr 


Pro Lys 
505 


Ser 


His 


Gly 


Lys 
510 


Thr 


Ala 


Lys 


Ser 


Asn 


He 


Val 


Ser 


Gin 


Val 


Tyr Leu 


Asn 


Gly Asp 


Lys 


Thr 


Lys 




515 










520 








525 








Pro 


Val 


Thr 


Leu 


Thr 


He 


Thr 


Leu 


Asn Gly Thr 


Gin 


Glu 


Thr 


Gly 


Asp 




530 










535 








540 










His 


Cys 


Asp 


Cys 


Arg 


Gly Asp 


Cys 


Phe Cys 


Thr 


Thr 


Pro 


Ser 


Ala 


Tyr 


545 








550 








555 










560 


Ser 


Met 


Ser 


Phe 


Ser 


Trp 


Asp 


Trp 


Ser Gly His 


Asn 


Tyr 


He 


Asn 


Glu 










565 






570 










575 




lie 


Phe 


Ala 


Thr 


Ser 


Ser 


Tyr 


Thr 


Phe Ser 


Tyr 


He 


Ala 


Gin 


Glu 





580 585 590 



<210> 49 
<211> 1746 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> 5F K012 
<400> 49 

atgaagcgcg caagaccgtc tgaagatacc ttcaaccccg tgtatccata tgacacggaa 60 
accggtcctc caactgtgcc ttttcttact cctccctttg tatcccccaa tgggtttcaa 12 0 
gagagtcccc ctggggtact ctctttgcgc ctatccgaac ctctagttac ctccaatggc 18 0 
atgcttgcgc tcaaaatggg caacggcctc tctctggacg aggccggcaa ccttacctcc 24 0 
caaaatgtaa ccactgtgag cccacctctc aaaaaaacca agtcaaacat aaacctggaa 3 00 
atatctgcac ccctcacagt tacctcagaa gccctaactg tggctgccgc cgcacctcta 360 
atggtcgcgg gcaacacact caccatgcaa tcacaggccc cgctaaccgt gcacgactcc 42 0 
aaacttagca ttgccaccca aggacccctc acagtgtcag aaggaaagct agccctgcaa 48 0 
acatcaggcc ccctcaccac caccgatagc agtaccctta ctatcactgc ctcaccccct 540 
ctaactactg ccactggtag cttgggcatt gacttgaaag agcccattta tacacaaaat 60 0 
ggaaaactag gactaaagta cggggctcct ttgcatgtaa cagacgacct aaacactttg 660 
accgtagcaa ctggtccagg tgtgactatt aataatactt ccttgcaaac taaagttact 72 0 
ggagccttgg gttttgattc acaaggcaat atgcaactta atgtagcagg aggactaagg 78 0 
attgattctc aaaacagacg ccttatactt gatgttagtt atccgtttga tgctcaaaac 840 
caactaaatc taagactagg acagggccct ctttttataa actcagccca caacttggat 90 0 
attaactaca acaaaggcct ttacttgttt acagcttcaa acaattccaa aaagcttgag 960 
gttaacctaa gcactgccaa ggggttgatg tttgacgcta cagccatagc cattaatgca 102 0 
ggagatgggc ttgaatttgg ttcacctaat gcaccaaaca caaatcccct caaaacaaaa 10 80 
attggccatg gcctagaatt tgattcaaac aaggctatgg ttcctaaact aggaactggc 1140 
cttagttttg acagcacagg tgccattaca gtaggaaaca aaaataatga taagctaact 12 0 0 
ttgtggacca caccagctcc atctcctaac tgttcactaa atggaggcgg agatgctaaa 12 60 
ctcactttgg tcttaacaaa atgtggcagt caaatacttg ctacagtttc agttttggct 1320 
gttaaaggca gtttggctcc aatatctgga acagttcaaa gtgctcatct tattataaga 13 80 
tttgacgaaa atggagtgct actaaacaat tccttcctgg acccagaata ttggaacttt 144 0 
agaaatggag atcttactga aggcacagcc tatacaaacg ctgttggatt tatgcctaac 1500 
ctatcagctt atccaaaatc tcacggtaaa actgccaaaa gtaacattgt cagtcaagtt 1560 
tacttaaacg gagacaaaac taaacctgta acactaacca ttacactaaa cggtacacag 1620 
gaaacaggag acacaactcc aagtgcatac tctatgtcat tttcatggga ctggtctggc 1680 
cacaactaca ttaatgaaat atttgccaca tcctcttaca ctttttcata cattgcccaa 1740 
gaataa 

<210> 50 
<211> 581 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 5F K012 
<400> 50 

Met Lys Arg Ala Arg Pro Ser 

1 5 
Tyr Asp Thr Glu Thr Gly Pro 
20 

Phe Val Ser Pro Asn Gly Phe 
35 

Leu Arg Leu Ser Glu Pro Leu 

50 55 
Lys Met Gly Asn Gly Leu Ser 
65 " 70 

Gin Asn Val Thr Thr Val Ser 
85 

lie Asn Leu Glu lie Ser Ala 
100 

Thr Val Ala Ala Ala Ala Pro 



Glu Asp Thr Phe Asn Pro Val Tyr Pro 

10 15 
Pro Thr Val Pro Phe Leu Thr Pro Pro 

25 30 
Gin Glu Ser Pro Pro Gly Val Leu Ser 
40 45 
Val Thr Ser Asn Gly Met Leu Ala Leu 
60 

Leu Asp Glu Ala Gly Asn Leu Thr Ser 

75 80 
Pro Pro Leu Lys Lys Thr Lys Ser Asn 

90 95 
Pro Leu Thr Val Thr Ser Glu Ala Leu 

105 HO 
Leu Met Val Ala Gly Asn Thr Leu Thr 
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115 

Met Gin Ser Gin 
130 

Ala Thr Gin Gly 
145 

Thr Ser Gly Pro 

Ala Ser Pro Pro 
180 

Lys Glu Pro lie 
195 

Ala Pro Leu His 
210 

Gly Pro Gly Val 
225 

Gly Ala Leu Gly 

Gly Gly Leu Arg 
260 

Ser Tyr Pro Phe 
2 75 

Gly Pro Leu Phe 
290 

Lys Gly Leu Tyr 
305 

Val Asn Leu Ser 

Ala He Asn Ala 
340 

Asn Thr Asn Pro 
355 

Ser Asn Lys Ala 
370 

Ser Thr Gly Ala 
385 

Leu Trp Thr Thr 

Gly Asp Ala Lys 
420 

Leu Ala Thr Val 
435 

Ser Gly Thr Val 
450 

Gly Val Leu Leu 
465 

Arg Asn Gly Asp 

Phe Met Pro Asn 
500 

Lys Ser Asn He 
515 

Pro Val Thr Leu 
530 

Thr Thr Pro Ser 
545 

His Asn Tyr He 

Tyr lie Ala Gin 
580 



120 

Ala Pro Leu Thr 
135 

Pro Leu Thr Val 
150 

Leu Thr Thr Thr 
165 

Leu Thr Thr Ala 

Tyr Thr Gin Asn 
200 

Val Thr Asp Asp 
215 

Thr He Asn Asn 
230 

Phe Asp Ser Gin 
245 

He Asp Ser Gin 

Asp Ala Gin Asn 
280 

He Asn Ser Ala 
295 

Leu Phe Thr Ala 
310 

Thr Ala Lys Gly 
325 

Gly Asp Gly Leu 

Leu Lys Thr Lys 
360 

Met Val Pro Lys 
375 

He Thr Val Gly 
390 

Pro Ala Pro Ser 
405 

Leu Thr Leu Val 

Ser Val Leu Ala 
440 

Gin Ser Ala His 
455 

Asn Asn Ser Phe 
470 

Leu Thr Glu Gly 
485 

Leu Ser Ala Tyr 

Val Ser Gin Val 
520 

Thr He Thr Leu 
535 

Ala Tyr Ser Met 
550 

Asn Glu He Phe 
565 
Glu 



Val His Asp Ser 
140 

Ser Glu Gly Lys 
155 

Asp Ser Ser Thr 
170 

Thr Gly Ser Leu 
185 

Gly Lys Leu Gly 

Leu Asn Thr Leu 
220 

Thr Ser Leu Gin 
235 

Gly Asn Met Gin 
250 

Asn Arg Arg Leu 
265 

Gin Leu Asn Leu 

His Asn Leu Asp 
300 

Ser Asn Asn Ser 
315 

Leu Met Phe Asp 
330 

Glu Phe Gly Ser 
345 

He Gly His" Gly 

Leu Gly Thr Gly 
380 

Asn Lys Asn Asn 
395 

Pro Asn Cys Ser 
410 

Leu Thr Lys Cys 
425 

Val Lys Gly Ser 

Leu He He Arg 
460 

Leu Asp Pro Glu 
475 

Thr Ala Tyr Thr 
490 

Pro Lys Ser His 
5 05 

Tyr Leu Asn Gly 

Asn Gly Thr Gin 
540 

Ser Phe Ser Trp 
555 

Ala Thr Ser Ser 
570 



125 

Lys Leu Ser He 

Leu Ala Leu Gin 
160 

Leu Thr He Thr 
175 

Gly He Asp Leu 
19 0 

Leu Lys Tyr Gly 
205 

Thr Val Ala Thr 

Thr Lys Val Thr 
240 

Leu Asn Val Ala 
255 

He Leu Asp Val 
270 

Arg Leu Gly Gin 
285 

He Asn Tyr Asn 

Lys Lys Leu Glu 
320 

Ala Thr Ala He 
335 

Pro Asn Ala Pro 
350 

Leu Glu Phe Asp 
365 

Leu Ser Phe Asp 

Asp Lys Leu Thr 
400 

Leu Asn Gly Gly 
415 

Gly Ser Gin He 
43 0 

Leu Ala Pro He 
445 

Phe Asp Glu Asn 

Tyr Trp Asn Phe 
480 

Asn Ala Val Gly 
495 

Gly Lys Thr Ala 
510 

Asp Lys Thr Lys 
525 

Glu Thr Gly Asp 

Asp Trp Ser Gly 
560 

Tyr Thr Phe Ser 
5 75 
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<210> 51 
<211> 1746 - 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> 5F S* 



<400> 51 

atgaagcgcg caagaccgtc tgaagatacc 
accggtcctc caactgtgcc ttttcttact 
gagagtcccc ctggggtact ctctttgcgc 
atgcttgcgc tcaaaatggg caacggcctc 
caaaatgtaa ccactgtgag cccacctctc 
atatctgcac ccctcacagt tacctcagaa 
atggtcgcgg gcaacacact caccatgcaa 
aaacttagca ttgccaccca aggacccctc 
acatcaggcc ccctcaccac caccgatagc 
ctaactactg ccactggtag cttgggcatt 
ggaaaactag gactaaagta cggggctcct 
accgtagcaa ctggtccagg tgtgactatt 
ggagccttgg gttttgattc acaaggcaat 
attgattctc aaaacagacg ccttatactt 
caactaaatc taagactagg acagggccct 
attaactaca acaaaggcct ttacttgttt 
gttaacctaa gcactgccaa ggggttgatg 
ggagatgggc ttgaatttgg ttcacctaat 
attggccatg gcctagaatt tgattcaaac 
cttagttttg acagcacagg tgccattaca 
ttgtggacca caccagctcc atctcctaac 
ctcactttgg tcttaacaaa atgtggcagt 
gttaaaggca gtttggctcc aatatctgga 
tttgacgaaa atggagtgct actaaacaat 
agaaatggag atcttactga aggcacagcc 
ctatcagctt atccaaaatc tcacggtaaa 
tacttaaacg gagacaaaac taaacctgta 
gaaacaggag acacaactcc aagtgcatac 
cacaactaca ttaatgaaat atttgccaca 
gaataa 

<210> 52 
<211> 581 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 5F S* 



ttcaaccccg tgtatccata tgacacggaa 60 
cctccctttg tatcccccaa tgggtttcaa 120 
ctatccgaac ctctagttac ctccaatggc 180 
tctctggacg aggccggcaa ccttacctcc 240 
ggagccggag cctcaaacat aaacctggaa 300 
gccctaactg tggctgccgc cgcacctcta 3 60 
tcacaggccc cgctaaccgt gcacgactcc 42 0 
acagtgtcag aaggaaagct agccctgcaa 48 0 
agtaccctta ctatcactgc ctcaccccct 540 
gacttgaaag agcccattta tacacaaaat 60 0 
ttgcatgtaa cagacgacct aaacactttg 660 
aataatactt ccttgcaaac taaagttact 72 0 
atgcaactta atgtagcagg aggactaagg 7 80 
gatgttagtt atccgtttga tgctcaaaac 84 0 
ctttttataa actcagccca caacttggat 90 0 
acagcttcaa acaattccaa aaagcttgag 96 0 
tttgacgcta cagccatagc cattaatgca 102 0 
gcaccaaaca caaatcccct caaaacaaaa 10 80 
aaggctatgg ttcctaaact aggaactggc 1140 
gtaggaaaca aaaataatga taagctaact 12 00 
tgtagactaa atgcagagaa agatgctaaa 12 60 
caaatacttg ctacagtttc agttttggct 1320 
acagttcaaa gtgctcatct tattataaga 13 80 
tccttcctgg acccagaata ttggaacttt 1440 
tatacaaacg ctgttggatt tatgcctaac 15 00 
actgccaaaa gtaacattgt cagtcaagtt 15 60 
acactaacca ttacactaaa cggtacacag 162 0 
tctatgtcat tttcatggga ctggtctggc 1680 
tcctcttaca ctttttcata cattgcccaa 1740 

1746 



<400> 52 

Met Lys Arq Ala Arg Pro Ser Glu Asp Thr Phe Asn Pro Val Tyr Pro 

1 5 10 15 

Tyr Asp Thr Glu Thr Gly Pro Pro Thr Val Pro Phe Leu Thr Pro Pro 

20 25 30 

Phe Val Ser Pro Asn Gly Phe Gin Glu Ser Pro Pro Gly Val Leu Ser 

35 40 45 

Leu Arg Leu Ser Glu Pro Leu Val Thr Ser Asn Gly Met Leu Ala Leu 

50 55 60 

Lys Met Gly Asn Gly Leu Ser Leu Asp Glu Ala Gly Asn Leu Thr Ser 
65 * 70 75 80 

Gin Asn Val Thr Thr Val Ser Pro Pro Leu Gly Ala Gly Ala Ser Asn 
85 90 95 



WO 2003/062400 



PCT/US2003/002295 



-41- 



T 1 o 

lie 


7\ on 


Leu 


Glu 


He 


Ser 


Ala 


Pro 


Leu 


Thr 


Val 


Thr 


Ser 


Glu 


Ala 


Leu 








inn 

JL \J \J 










105 










110 






Thr 


Val 


Ala 


Ala 


Ala 


Ala 


Pro 


Leu 


Met 


Val 


Ala 


Gly 


Asn 


Thr 


Leu 


Thr 






115 










120 










125 








Met 


Gin 


Ser 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


His 


Asp 


Ser 


Lys 


Leu 


Ser 


He 




130 










135 










140 










Ala 


Thr 


Gin 


Gly 


Pro 


Leu 


Thr 


Val 


Ser 


Glu 


Gly 


Lys 


Leu 


Ala 


Leu 


Gin 


145 








150 










155 










160 


Thr 


Ser 


Gly 


Pro 


Leu 


Thr 


Thr 


Thr 


Asp 


Ser 


Ser 


Thr 


Leu 


Thr 


He 


Thr 


















170 










175 




Ala 


Ser 


Pro 


Pro 


Leu 


Thr 


Thr 


Ala 


Thr 


Gly 


Ser 


Leu 


Gly 


He 


Asp 


Leu 








180 










185 










190 






L Y S 


Glu 


Pro 


He 


Tvr 
xy j_ 


Thr 


Gin 


Asn Gly 


Lys 


Leu 


Gly 


Leu 


Lys 


Tyr 


Gly 




195 










200 










205 








Ala 


Pro 


Leu 


His 


Val 


Thr Asp 


Asp 


Leu 


Asn 


Thr 


Leu 


Thr 


Val 


Ala 


Thr 




210 










/ID 










220 










Gly Pro 


Gly Val 


Thr 


He 


Asn 


Asn 


Thr 


Ser 


Leu 


Gin 


Thr 


Lys 


Val 


Thr 


225 










230 










235 










240 


Gly Ala Leu Gly 


Phe 


Asp 


Ser 


Gin Gly Asn Met 


Gin 


Leu 


Asn 


Val 


Ala 










245 










250 










255 




Gly Gly Leu Arg 


He 


Asp 


Ser 


Gin Asn Arg Arg 


Leu 


He 


Leu 


Asp 


Val 








260 










265 










270 






Ser 


Tyr 


Pro 


Phe 


Asp 


Ala 


\jJ-Ll. 


Asn 


Gin 


Leu 


Asn 


Leu Arg Leu Gly 


Gin 




275 








280 










285 








Gly 


Pro 


Leu 


Phe 


He 


Asn 


Ser 


Ala 


His 


Asn 


Leu 


Asp 


He 


Asn 


Tyr 


Asn 


290 




















300 










Lys 


Gly 


Leu 


Tyr 


Leu 


Phe 


±L1JL 


Ala 


Ser 


Asn 


Asn 


Ser 


l y s 


Lys 


Leu 


Glu 


305 






310 










315 










320 


Val 


Asn 


Leu- 


Ser 


Thr 


Ala 


Lys 


Gly Leu 


Met 


Phe 


Asp 


Ala 


Thr 


Ala 


He 










325 








330 










335 




Ala 


He 


Asn 


Ala 


Gly Asp 


o-jLy 


Leu 


Glu 


Phe 


Gly 


Ser 


Pro 


Asn 


Ala 


Pro 








340 










345 










350 






Asn 


Thr 


Asn 


Pro 


Leu 


Lys 


Thr 


Lys 


He Gly His 


Gly 


Leu 


Glu 


Phe 


Asp 






355 










360 










365 








Ser 


Asn 


Lys 


Ala 


Met 


Val 


Pro 


Lys 


Leu Gly Thr 


Gly Leu 


Ser 


Phe 


Asp 




370 








375 










380 










Ser Thr Gly Ala 


He 


Thr 


Val 


Gly Asn 


Lys 


Asn 


Asn Asp 


Lys 


Leu 


Thr 


385 










390 










395 










400 


Leu 


Trp 


Thr 


Thr 


Pro 


Ala 


Pro 


Ser 


Pro 


Asn 


Cys 


Arg 


Leu 


Asn 


Ala 


Glu 








405 










410 










415 




Lys 


Asp 


Ala 


Lys 


Leu 


Thr 


Leu 


Val 


Leu 


Thr 


Lys 


Cys 


Gly 


Ser 


Gin 


He 




420 










425 










430 






Leu 


Ala 


Thr 


Val 


Ser 


Val 


Leu 


Ala 


Val 


Lys 


Gly 


Ser 


Leu 


Ala 


Pro 


He 






435 










440 










445 








Ser 


Gly 


Thr 


Val 


Gin 


Ser 


Ala 


His 


Leu 


He 


He 


Arg 


Phe 


Asp 


Glu 


Asn 




450 










455 










460 










Gly Val 


Leu 


Leu 


Asn 


Asn 


Ser 


Phe 


Leu 


Asp 


Pro 


Glu 


Tyr 


Trp 


Asn 


Phe 


465 










470 










475 










480 


Arg 


Asn 


Gly Asp 


Leu 


Thr 


Glu 


Gly 


Thr 


Ala 


Tyr 


Thr 


Asn 


Ala 


Val 


Gly 








485 










490 










495 




Phe 


Met 


Pro 


Asn 


Leu 


Ser 


Ala 


Tyr 


Pro 


Lys 


Ser 


His 


Gly 


Lys 


Thr 


Ala 








500 










505 










510 






Lys 


Ser 


Asn 


He 


Val 


Ser 


Gin 


Val 


Tyr 


Leu 


Asn 


Gly Asp 


Lys 


Thr 


Lys 




515 










520 










525 








Pro 


Val 


Thr 


Leu 


Thr 


He 


Thr 


Leu Asn Gly 


Thr 


Gin 


Glu 


Thr 


Gly 


Asp 




530 










535 










540 










Thr 


Thr 


Pro 


Ser 


Ala 


Tyr 


Ser 


Met 


Ser 


Phe 


Ser 


Trp 


Asp 


Trp 


Ser 


Gly 


545 










550 










555 










560 


His 


Asn ,Tyr 


He 


Asn 


Glu 


He 


Phe 


Ala 


Thr 


Ser 


Ser 


Tyr 


Thr 


Phe 


Ser 










565 










570 










575 




Tyr 


He 


Ala 


Gin 


Glu 
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580 



<210> 53 
<211> 1776 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 5F S*RGD 
<400> 53 

atgaagcgcg caagaccgtc tgaagatacc ttcaaccccg tgtatccata tgacacggaa 60 
accggtcctc caactgtgcc ttttcttact cctccctttg tatcccccaa tgggtttcaa 12 0 
gagagtcccc ctggggtact ctctttgcgc ctatccgaac ctctagttac ctccaatggc 180 
atgcttgcgc tcaaaatggg caacggcctc tctctggacg aggccggcaa ccttacctcc 240 
caaaatgtaa ccactgtgag cccacctctc ggagccggag cctcaaacat aaacctggaa 30 0 
atatctgcac ccctcacagt tacctcagaa gccctaactg tggctgccgc cgcacctcta 360 
atggtcgcgg gcaacacact caccatgcaa tcacaggccc cgctaaccgt gcacgactcc 420 
aaacttagca ttgccaccca aggacccctc acagtgtcag aaggaaagct agccctgcaa 480 
acatcaggcc ccctcaccac caccgatagc agtaccctta ctatcactgc ctcaccccct 54 0 
ctaactactg ccactggtag cttgggcatt gacttgaaag agcccattta tacacaaaat 60 0 
ggaaaactag gactaaagta cggggctcct ttgcatgtaa cagacgacct aaacactttg 660 
accgtagcaa ctggtccagg tgtgactatt aataatactt ccttgcaaac taaagttact 72 0 
ggagccttgg gttttgattc acaaggcaat atgcaactta atgtagcagg aggactaagg 7 80 
attgattctc aaaacagacg ccttatactt gatgttagtt atccgtttga tgctcaaaac 84 0 
caactaaatc taagactagg acagggccct ctttttataa actcagccca caacttggat 90 0 
attaactaca acaaaggcct ttacttgttt acagcttcaa acaattccaa aaagcttgag 960 
gttaacctaa gcactgccaa ggggttgatg tttgacgcta cagccatagc cattaatgca 1020 
ggagatgggc ttgaatttgg ttcacctaat gcaccaaaca caaatcccct caaaacaaaa 1080 
attggccatg gcctagaatt tgattcaaac aaggctatgg ttcctaaact aggaactggc 1140 
cttagttttg acagcacagg tgccattaca gtaggaaaca aaaataatga taagctaact 12 0 0 
ttgtggacca caccagctcc atctcctaac tgtagactaa atgcagagaa agatgctaaa 12 60 
ctcactttgg tcttaacaaa atgtggcagt caaatacttg ctacagtttc agttttggct 132 0 
gttaaaggca gtttggctcc aatatctgga acagttcaaa gtgctcatct tattataaga 13 8 0 
tttgacgaaa atggagtgct actaaacaat tccttcctgg acccagaata ttggaacttt 144 0 
agaaatggag atcttactga aggcacagcc tatacaaacg ctgttggatt tatgcctaac 15 0 0 
ctatcagctt atccaaaatc tcacggtaaa actgccaaaa gtaacattgt cagtcaagtt 1560 
tacttaaacg gagacaaaac taaacctgta acactaacca ttacactaaa cggtacacag 162 0 
gaaacaggtg atcattgtga ttgtcgtggt gattgttttt gtacaactcc aagtgcatac 1680 
tctatgtcat tttcatggga ctggtctggc cacaactaca ttaatgaaat atttgccaca 1740 
tcctcttaca ctttttcata cattgcccaa gaataa 1776 

<210> 54 
<211> 591 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 5F S*RGD 
<400> 54 

Met Lys Arg Ala Arg Pro Ser Glu Asp Thr Phe Asn Pro Val Tyr Pro 

1 5 10 15 

Tyr Asp Thr Glu Thr Gly Pro Pro Thr Val Pro Phe Leu Thr Pro Pro 

2 0 25 3 0 

Phe Val Ser Pro Asn Gly Phe Gin Glu Ser Pro Pro Gly Val Leu Ser 

35 40 45 

Leu Arg Leu Ser Glu Pro Leu Val Thr Ser Asn Gly Met Leu Ala Leu 

50 55 60 

Lys Met Gly Asn Gly Leu Ser Leu Asp Glu Ala Gly Asn Leu Thr Ser 
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65 

Gin Asn Val Thr 

lie Asn Leu Gin 
100 

Thr Val Ala Ala 
115 

Met Gin Ser Gin 
130 

Ala Thr Gin Gly 
145 

Thr Ser Gly Pro 

Ala Ser Pro Pro 
180 

Lys Glu Pro lie 
195 

Ala Pro Leu His 
210 

Gly Pro Gly Val 
225 

Gly Ala Leu Gly 

Gly Gly Leu Arg 
260 

Ser Tyr Pro Phe 
275 

Gly Pro Leu Phe 
290 

Lys Gly Leu Tyr 
3 05 

Val Asn Leu Ser 

Ala lie Asn Ala 
340 

Asn Thr Asn Pro 
355 

Ser Asn Lys Ala 
370 

Ser Thr Gly Ala 
385 

Leu Trp Thr Thr 

Lys Asp Ala Lys 
420 

Leu Ala Thr Val 
435 

Ser Gly Thr Val 
450 

Gly Val Leu Leu 
465 

Arg Asn Gly Asp 

Phe Met Pro Asn 
500 

Lys Ser Asn lie 
515 

Pro Val Thr Leu 
530 

His Cys Asp Cys 
545 





70 






Thr 


vax 


Ser 


Pro 


O IT 








i le 


Ser 


Ala 


Pro 


Ala. 


Aia 


Pro 


Leu 








120 


Aia. 


Pro 


Leu 


Thr 






135 




Pro 


Leu 


Thr 


Val 




150 






Leu 


Thr 


Thr 


Thr 


lob 








Leu 


Tnr 


Thr 


Ala 


Tyr 


1 xir 


Gin 


Asn 








200 


Val 


Thr 


Asp Asp 






_L 3 




Thr 


He 


Asn 


Asn 




230 






Phe 


Asp 


Ser 


Gin 


j£ *± D 








lie 


Asp 


Ser 


Gin 




■c-ild 


Gin 


Asn 








280 


lie 


Asn 


Ser 


Ala 






295 




Leu 


Phe 


Thr 


Ala 




310 






Thr 


7\ "1 o 


Lys 


Gly 


-5 Zz> 










7\ ay~\ 


Gly 


Leu 


Leu 


Lys 


Thr 


Lys 








360 




V Ct _L 


Pro 


Lys 






375 




lie 


Thr 


Val 


Gly 




390 






Pro 


Hid. 


Pro 


Ser 


A n c: 








Leu 


Tnr 


Leu 


Val 


C? <=> -y~ 
O C .L. 


Val 


Leu 


Ala 








440 


Gin 


Ser 


Ala 


His 






455 




Asn 


Asn 


Ser 


Phe 




470 






Leu 


Thr 


Glu Gly 


485 








Leu 


Ser 


Ala 


Tyr 


Val 


Ser 


Gin 


Val 








520 


Thr 


He 


Thr 


Leu 






535 




Arg 


Gly 


Asp 


Cys 




550 







75 

Pro Leu Gly Ala 
90 

Leu Thr Val Thr 
105 

Met Val Ala Gly 

Val His Asp Ser 
140 

Ser Glu Gly Lys 
155 

Asp Ser Ser Thr 
170 

Thr Gly Ser Leu 
185 

Gly Lys Leu Gly 

Leu Asn Thr Leu 
220 

Thr Ser Leu Gin 
235 

Gly Asn Met Gin 
250 

Asn Arg Arg Leu 
265 

Gin Leu Asn Leu 

His Asn Leu Asp 
300 

Ser Asn Asn Ser 
315 

Leu Met Phe Asp 
330 

Glu Phe Gly Ser 
345 

lie Gly His Gly 

Leu Gly Thr Gly 
380 

Asn Lys Asn Asn 
395 

Pro Asn Cys Arg 
410 

Leu Thr Lys Cys 
425 

Val Lys Gly Ser 

Leu lie He Arg 
460 

Leu Asp Pro Glu 
475 

Thr Ala Tyr Thr 
490 

Pro Lys Ser His 
505 

Tyr Leu Asn Gly 

Asn Gly Thr Gin 
540 

Phe Cys Thr Thr 
555 



80 

Gly Ala Ser Asn 
95 

Ser Glu Ala Leu 
110 

Asn Thr Leu Thr 
125 

Lys Leu Ser He 

Leu Ala Leu Gin 
160 

Leu Thr He Thr 
175 

Gly He Asp Leu 
190 

Leu Lys Tyr Gly 
205 

Thr Val Ala Thr 

Thr Lys Val Thr 
240 

Leu Asn Val Ala 
255 

He Leu Asp Val 
270 

Arg Leu Gly Gin 
285 

He Asn Tyr Asn 

Lys Lys Leu Glu 
32 0 

Ala Thr Ala He 
335 

Pro Asn Ala Pro 
350 

Leu Glu Phe Asp 
365 

Leu Ser Phe Asp 

Asp Lys Leu Thr 
400 

Leu Asn Ala Glu 
415 

Gly Ser Gin He 
430 

Leu Ala Pro He 
445 

Phe Asp Glu Asn 

Tyr Trp Asn Phe 
480 

Asn Ala Val Gly 
495 

Gly Lys Thr Ala 
510 

Asp Lys Thr Lys 
525 

Glu Thr Gly Asp 

Pro Ser Ala Tyr 
560 
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Ser Met Ser Phe Ser Trp Asp Trp Ser Gly His Asn Tyr lie Asn Glu 

565 ^ 570 575 

lie Phe Ala Thr Ser Ser Tyr Thr Phe Ser Tyr He Ala Gin Glu 

580 585 590 



<210> 55 
<211> 1746 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 5F KOIS* 



<400> 55 
atgaagcgcg 
accggtcctc 
gagagtcccc 
atgcttgcgc 
caaaatgtaa 
atatctgcac 
atggtcgcgg 
aaacttagca 
acatcaggcc 
ctaactactg 
1 ggaaaactag 
accgtagcaa 
ggagccttgg 
attgattctc 
caactaaatc 
attaactaca 
gttaacctaa 
ggagatgggc 
attggccatg 
cttagttttg 
ttgtggacca 
ctcactttgg 
gttaaaggca 
tttgacgaaa 
agaaatggag 
ctatcagctt 
tacttaaacg 
gaaacaggag 
cacaactaca 
gaataa 



caagaccgtc 
caactgtgcc 
ctggggtact 
tcaaaatggg 
ccactgtgag 
ccctcacagt 
gcaacacact 
ttgccaccca 
ccctcaccac 
ccactggtag 
gactaaagta 
ctggtccagg 
gttttgattc 
aaaacagacg 
taagactagg 
acaaaggcct 
gcactgccaa 
ttgaatttgg 
gcctagaatt 
acagcacagg 
caccagctcc 
tcttaacaaa 
gtttggctcc 
atggagtgct 
atcttactga 
atccaaaatc 
gagacaaaac 
acacaactcc 
ttaatgaaat 



tgaagatacc 
ttttcttact 
ctctttgcgc 
caacggcctc 
cccacctctc 
tacctcagaa 
caccatgcaa 
aggacccctc 
caccgatagc 
cttgggcatt 
cggggctcct 
tgtgactatt 
acaaggcaat 
ccttatactt 
acagggccct 
ttacttgttt 

ggggttgatg 

ttcacctaat 
tgattcaaac 
tgccattaca 
agaggctaac 
atgtggcagt 
aatatctgga 
actaaacaat 
aggcacagcc 
tcacggtaaa 
taaacctgta 
aagtgcatac 
atttgccaca 



ttcaaccccg 
cctccctttg 
ctatccgaac 
tctctggacg 
ggagccggag 
gccctaactg 
tcacaggccc 
acagtgtcag 
agtaccctta 
gacttgaaag 
ttgcatgtaa 
aataatactt 
atgcaactta 
gatgttagtt 
ctttttataa 
acagcttcaa 
tttgacgcta 
gcaccaaaca 
aaggctatgg 
gtaggaaaca 
tgtagactaa 
caaatacttg 
acagttcaaa 
tccttcctgg 
tatacaaacg 
actgccaaaa 
acactaacca 
tctatgtcat 
tcctcttaca 



tgtatccata 
tatcccccaa 
ctctagttac 
aggccggcaa 
cctcaaacat 
tggctgccgc 
cgctaaccgt 
aaggaaagct 
ctatcactgc 
agcccattta 
cagacgacct 
ccttgcaaac 
atgtagcagg 
atccgtttga 
actcagccca 
acaattccaa 
cagccatagc 
caaatcccct 
ttcctaaact 
aaaataatga 
atgcagagaa 
ctacagtttc 
gtgctcatct 
acccagaata 
ctgttggatt 
gtaacattgt 
ttacactaaa 
tttcatggga 
ctttttcata 



tgacacggaa 
tgggtttcaa 
ctccaatggc 
ccttacctcc 
aaacctggaa 
cgcacctcta 
gcacgactcc 
agccctgcaa 
ctcaccccct 
tacacaaaat 
aaacactttg 
taaagttact 
aggactaagg 
tgctcaaaac 
caacttggat 
aaagcttgag 
cattaatgca 
caaaacaaaa 
aggaactggc 
taagctaact 
agatgctaaa 
agttttggct 
tattataaga 
ttggaacttt 
tatgcctaac 
cagtcaagtt 
cggtacacag 
ctggtctggc 
cattgcccaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1746 



<210> 56 
<211> 581 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 5F KOIS* 



<400> 56 

Met Lys Arg Ala- Arg Pro Ser Glu Asp Thr Phe Asn Pro Val Tyr Pro 

1 5 10 15 

Tyr Asp Thr Glu Thr Gly Pro Pro Thr Val Pro Phe Leu Thr Pro Pro 

2 0 25 3 0 

Phe Val Ser Pro Asn Gly Phe Gin Glu Ser Pro Pro Gly Val Leu Ser 
35 40 45 
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Leu Arg Leu Ser Glu Pro Leu Val Thr Ser Asn Gly Met Leu Ala Leu 

50 55 60 

Lys Met Gly Asn Gly Leu Ser Leu Asp Glu Ala Gly Asn Leu Thr Ser 
65 70 75 80 

Gin Asn Val Thr Thr Val Ser Pro Pro Leu Gly Ala Gly Ala Ser Asn 

85 90 95 

lie Asn Leu Glu lie Ser Ala Pro Leu Thr Val Thr Ser Glu Ala Leu 

100 105 110 

Thr Val Ala Ala Ala Ala Pro Leu Met Val Ala Gly Asn Thr Leu Thr 

115 120 125 

Met Gin Ser Gin Ala Pro Leu Thr Val His Asp Ser Lys Leu Ser lie 

130 135 140 

Ala Thr Gin Gly Pro Leu Thr Val Ser Glu Gly Lys Leu Ala Leu Gin 
145 ^ 150 155 160 

Thr Ser Gly Pro Leu Thr Thr Thr Asp Ser Ser Thr Leu Thr lie Thr 

165 170 175 

Ala Ser Pro Pro Leu Thr Thr Ala Thr Gly Ser Leu Gly lie Asp Leu 

180 185 190 

Lys Glu Pro He Tyr Thr Gin Asn Gly Lys Leu Gly Leu Lys Tyr Gly 

195 200 205 

Ala Pro Leu His Val Thr Asp Asp Leu Asn Thr Leu Thr Val Ala Thr 

210 215 220 

Gly Pro Gly Val Thr He Asn Asn Thr Ser Leu Gin Thr Lys Val Thr 
225 " 230 235 240 

Gly Ala Leu Gly Phe Asp Ser Gin Gly Asn Met Gin Leu Asn Val Ala 

245 250 255 

Gly Gly Leu Arg He Asp Ser Gin Asn Arg Arg Leu lie Leu Asp Val 

260 265 270 

Ser Tyr Pro Phe Asp Ala Gin Asn Gin Leu Asn Leu Arg Leu Gly Gin 

275 280 285 

Gly Pro Leu Phe He Asn Ser Ala His Asn Leu Asp He Asn Tyr Asn 

290 295 300 

Lys Gly Leu Tyr Leu Phe Thr Ala Ser Asn Asn Ser Lys Lys Leu Glu 
305 ~ 310 315 320 

Val Asn Leu Ser Thr Ala Lys Gly Leu Met Phe Asp Ala Thr Ala He 

325 330 335 

Ala He Asn Ala Gly Asp Gly Leu Glu Phe Gly Ser Pro Asn Ala Pro 

340 345 350 

Asn Thr Asn Pro Leu Lys Thr Lys He Gly His Gly Leu Glu Phe Asp 

355 360 365 

Ser Asn Lys Ala Met Val Pro Lys Leu Gly Thr Gly Leu Ser Phe Asp 

370 375 380 

Ser Thr Gly Ala He Thr Val Gly Asn Lys Asn Asn Asp Lys Leu Thr 
385 ~ 390 395 400 

Leu Trp Thr Thr Pro Ala Pro Glu Ala Asn Cys Arg Leu Asn Ala Glu 

405 410 415 

Lys Asp Ala Lys Leu Thr Leu Val Leu Thr Lys Cys Gly Ser Gin He 

420 425 430 

Leu Ala Thr Val Ser Val Leu Ala Val Lys Gly Ser Leu Ala Pro He 

435 440 445 

Ser Gly Thr Val Gin Ser Ala His Leu He He Arg Phe Asp Glu Asn 

450 455 460 

Gly Val Leu Leu Asn Asn Ser Phe Leu Asp Pro Glu Tyr Trp Asn Phe 
465 470 475 480 

Arg Asn Gly Asp Leu Thr Glu Gly Thr Ala Tyr Thr Asn Ala Val Gly 

485 490 495 

Phe Met Pro Asn Leu Ser Ala Tyr Pro Lys Ser His Gly Lys Thr Ala 

500 505 510 

Lys Ser Asn He Val Ser Gin Val Tyr Leu Asn Gly Asp Lys Thr Lys 

515 520 525 

Pro Val Thr Leu Thr He Thr Leu Asn Gly Thr Gin Glu Thr Gly Asp 
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530 535 540 

Thr Thr Pro Ser Ala Tyr Ser Met Ser Phe Ser Trp Asp Trp Ser Gly 
545 550 555 560 

His Asn Tyr He Asn Glu He Phe Ala Thr Ser Ser Tyr Thr Phe Ser 

565 570 575 

Tyr He Ala Gin Glu 
580 



<210> 57 
<211> 1776 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 5F KO!S*RGD 



<400> 57 

atgaagcgcg 

accggtcctc 

gagagtcccc 

atgcttgcgc 

caaaatgtaa 

atatctgcac 

atggtcgcgg 

aaacttagca 

acatcaggcc 

ctaactactg 

ggaaaactag 

accgtagcaa 

ggagccttgg 

attgattctc 

caactaaatc 

attaactaca 

gttaacctaa 

ggagatgggc 

attggccatg 

cttagttttg 

ttgtggacca 

ctcactttgg 

gttaaaggca 

tttgacgaaa 

agaaatggag 

ctatcagctt 

tacttaaacg 

gaaacaggtg 

tctatgtcat 

tcctcttaca 



caagaccgtc 
caactgtgcc 
ctggggtact 
tcaaaatggg 
ccactgtgag 
ccctcacagt 
gcaacacact 
ttgccaccca 
ccctcaccac 
ccactggtag 
gactaaagta 
ctggtccagg 
gttttgattc 
aaaacagacg 
taagactagg 
acaaaggcct 
gcactgccaa 
ttgaatttgg 
gcctagaatt 
acagcacagg 
caccagctcc 
tcttaacaaa 
gtttggctcc 
atggagtgct 
atcttactga 
atccaaaatc 
gagacaaaac 
atcattgtga 
tttcatggga 
ctttttcata 



tgaagatacc 
ttttcttact 
ctctttgcgc 
caacggcctc 
cccacctctc 
tacctcagaa 
caccatgcaa 
aggacccctc 
caccgatagc 
cttgggcatt 
cggggctcct 
tgtgactatt 
acaaggcaat 
ccttatactt 
acagggccct 
ttacttgttt 
ggggttgatg 
ttcacctaat 
tgattcaaac 
tgccattaca 
agaggctaac 
atgtggcagt 
aatatctgga 
actaaacaat 
aggcacagcc 
tcacggtaaa 
taaacctgta 
ttgtcgtggt 
ctggtctggc 
cattgcccaa 



ttcaaccccg 
cctccctttg 
ctatccgaac 
tctctggacg 
ggagccggag 
gccctaactg 
tcacaggccc 
acagtgtcag 
agtaccctta 
gacttgaaag 
ttgcatgtaa 
aataatactt 
atgcaactta 
gatgttagtt 
ctttttataa 
acagcttcaa 
tttgacgcta 
gcaccaaaca 
aaggctatgg 
gtaggaaaca 
tgtagactaa 
caaatacttg 
acagttcaaa 
tccttcctgg 
tatacaaacg 
actgccaaaa 
acactaacca 
gattgttttt 
cacaactaca 
gaataa 



tgtatccata 
tatcccccaa 
ctctagttac 
aggccggcaa 
cctcaaacat 
tggctgccgc 
cgctaaccgt 
aaggaaagct 
ctatcactgc 
agcccattta 
cagacgacct 
ccttgcaaac 
atgtagcagg 
atccgtttga 
actcagccca 
acaattccaa 
cagccatagc 
caaatcccct 
ttcctaaact 
aaaataatga 
atgcagagaa 
ctacagtttc 
gtgctcatct 
acccagaata 
ctgttggatt 
gtaacattgt 
ttacactaaa 
gtacaactcc 
ttaatgaaat 



tgacacggaa 
tgggtttcaa 
ctccaatggc 
ccttacctcc 
aaacctggaa 
cgcacctcta 
gcacgactcc 
agccctgcaa 
ctcaccccct 
tacacaaaat 
aaacactttg 
taaagttact 
aggactaagg 
tgctcaaaac 
caacttggat 
aaagcttgag 
cattaatgca 
caaaacaaaa 
aggaactggc 
taagctaact 
agatgctaaa 
agttttggct 
tattataaga 
ttggaacttt 
tatgcctaac 
cagtcaagtt 
cggtacacag 
aagtgcatac 
atttgccaca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1776 



<210> 58 
<211> 591 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 5F KO!S*RGD 



<400> 58 

Met Lys Arg Ala Arg Pro Ser Glu Asp Thr Phe Asn Pro Val Tyr Pro 

1 5 10 15 

Tyr Asp Thr Glu Thr Gly Pro Pro Thr Val Pro Phe Leu Thr Pro Pro 
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20 



irlie 


TT 2 "I 




Pro 


Asn Gly 


Phe 


Gin 






35 












Leu 


Arg 


Leu 


Ser 


Glu 


Pro 


J_lt. u. 


V d JL 




ZD u 










ZD ZD 




T 

Lys 


M«=»-t- 
i v ie u 


uiy 


Asn Gly 


Leu 


Ser 


Leu 


O ZD 










/ u 






bin 


Asn 


v ax 


Thr 


Thr 


Vdl 


DCl 


Jtr JL LJ 










ob 








x xe 


Asn 


Leu 




lie 


OCI 




Pro 








10 u 










Tnr 


V dX 


7\ I 

Hid 


Aia 


7\ 1 

Hid 


7\ I => 


Jr JL W 


Leu 






115 










120 




nil -n 


O t2 


(j-Ln 


7\ 1 

Ala 


Pro 


Leu 


Thr 




13 0 










135 




■ri—L Ct 


ill! 


m n 

UJ.I1 


uiy 


Prn 


T.#=»i i 

UC LI 


Thr 


Val 


145 










IjU 






X 11 JL 


o tr J_ 


w _l _y 


Pro 


T 

Leu 


JL 11JL 


Thr 


Thr 








lbb 










OCi 


Jr JL 


Pro 


Leu 


Thr 


Thr 


Ala 








loU 










Lys 


Glu 


Pro 


lie 


Tyr 


Thr 


Gin 


Asn 






195 










200 


7\ 1 =3 


Jr J- 


Leu 


JtllS 


Val 


Thr 


Asp 


Asp 




210 










215 




Gly 


XT -L» 


Gly 


vdl 


Thr 


He 


Asn 


Asn 


22 5 










230 






Gly 


Ala 


Leu 


uiy 


Phe 


Asp 


Ser 


Gin 










245 








Gly 




Jut: Li 


niy 


He 


Asp 


Ser 


Gin 








o ez n 










faer 


±yr 


ri f 


jyne 


Asp 


Ala 


Gin 


Asn 






275 










280 




Piro 


Leu 


jyne 


He 


Asn 


Ser 


Ala 


290 










295 






Gly 


Leu 


Tyr 


Leu 


Phe 


Thr 


Ala 


-j \j -j 










310 






Val 


Asn 


Leu 


Ser 


Thr 


Ala 


Lys 


Gly 










325 








Ala 


He 


Asn 


Ala 


Gly Asp 


Gly 


Leu 








340 










JAtoll 


Tin 

-L 111. 


Asn 


Pro 


Leu 


Lys 


Thr 


Lys 






355 










360 


Del 


jritoll 




Ala 


Met 


Val 


Pro 


Lys 




370 








375 




Ser 


Thr 


Gly 


Ala 


He 


Thr 


Val 


Gly 


385 










390 






Leu 


Trp 


Thr 


Thr 


Pro 


Ala 


Pro 


Glu 








405 








Lys 


Asp 


Ala 


Lys 


Leu 


Thr 


Leu 


Val 








420 










Leu 


Ala 


Thr 


Val 


Ser 


Val 


Leu 


Ala 






435 










440 


Ser 


Gly 


Thr 


Val 


Gin 


Ser 


Ala 


His 




450 










455 




Gly 


Val 


Leu 


Leu 


Asn 


Asn 


Ser 


Phe 


465 










470 






Arg 


Asn 


Gly 


Asp 


Leu 


Thr 


Glu 


Gly 










485 








Phe 


Met 


Pro 


Asn 


Leu 


Ser 


Ala 


Tyr 



500 



25 30 



Glu 


Ser 


Pro 


Pro 


Gly Val 


Leu 


Ser 










45 








Thr 


Ser 


Asn 


Gly 


Met 


Leu 


Ala 


Leu 








60 










Asp 


Glu 


Ala 


Gly Asn 


Leu 


Thr 


Ser 






75 










80 


Prn 
Jr _L <w> 


Leu 


Gly 


Ala 


Gly Ala 


Ser 


Asn 




90 










95 




Leu 


Thr 


Val 


Thr 


Ser 


Glu 


Ala 


Leu 


105 










X J- \J 






Met 


Val 


Ala 


Gly Asn 


X ll-L 


Leu 


Thr 










125 








Val 


His 


Asp 


Ser 


Lys 


Leu 


Ser 


He 








140 










Ser 


Glu Gly 


Lys 


Leu 


Hid 


Leu 


Gin 






155 










160 


Asp 


Ser 


Ser 


Thr 


Leu 


Thr 


He 


Thr 




170 










175 




Thr 


Gly Ser 


Leu Gly 


He 


Asp 


Leu 


185 










190 






Gly 


Lys 


Leu 


Gly Leu 


Lys 


Tvr 
j. j j- 


Glv 










205 








Leu 


Asn 


Thr 


Leu 


Thr 


Val 


Ala 


Thr 








220 










Thr 


Ser 


Leu 


Gin 


Thr 


Lys 


Val 


Thr 






235 










240 


Gly 


Asn 


Met 


Gin 


Leu 


Asn 


Val 


Ala 


250 










255 




Asn 


Arg 


Arg 


Leu 


He 


Leu 


Asp 


Val 


265 










270 






Gin 


Leu 


Asn 


Leu 


Arg 


Leu 


Gly 


Gin 










285 








His 


Asn 


Leu 


Asp 


He 


Asn 


Tvr 


Asn 








300 










Ser 


Asn 


Asn 


Ser 


Lys 


Lys 


Leu 


Glu 






315 










320 


Leu 


Met 


Phe 


Asp 


Ala 


Thr 


Ala 


He 




330 










335 




Glu 


Phe 


Gly 


Ser 


Pro 


Asn 


Ala 


Pro 


345 










350 






He 


Gly His 


Gly Leu Glu 


Phe 


Asp 










365 








Leu 


Gly Thr 


Gly Leu 


Ser 


Phe 


Asp 








380 










Asn 


Lys 


Asn 


Asn Asp 


Lys 


Leu 


Thr 






395 










400 


Ala 


Asn 


Cys 


Arg 


Leu 


Asn 


Ala 


Glu 




410 










415 




Leu 


Thr 


Lys 


Cys 


Gly 


Ser 


Gin 


He 


425 










430 






Val 


Lys 


Gly 


Ser 


Leu 


Ala 


Pro 


He 










445 








Leu 


He 


He 


Arg 


Phe 


Asp 


Glu 


Asn 








460 










Leu 


Asp 


Pro 


Glu 


Tyr 


Trp 


Asn 


Phe 






475 










480 


Thr 


Ala 


Tyr 


Thr 


Asn 


Ala 


Val 


Gly 




490 










495 




Pro 


Lys 


Ser 


His 


Gly 


Lys 


Thr 


Ala 


505 










510 
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Lys 


Ser 


Asn 


He 


Val 


Ser 


Gin 


Val 




515 










520 


Pro 


Val 
530 


Thr 


Leu 


Thr 


He 


Thr 
535 


Leu 


His 


Cys 


Asp 


Cys 


Arg 


Gly Asp 


Cys 


545 










550 






Ser 


Met 


Ser 


Phe 


Ser 
565 


Trp 


Asp 


Trp 


He 


Phe 


Ala 


Thr 
580 


Ser 


Ser 


Tyr 


Thr 



Tyr 


Leu 


Asn 


Gly Asp 


Lys 


Thr 


Lys 










525 








Asn 


Gly 


Thr 


Gin 


Glu 


Thr 


Gly Asp 








540 










Phe 


Cys 


Thr 


Thr 


Pro 


Ser 


Ala 


Tyr 






555 










560 


Ser 


Gly 


His 


Asn 


Tyr 


He 


Asn 


Glu 




570 










575 




Phe 


Ser 


Tyr 


He 


Ala 


Gin 


Glu 




585 








590 







<210> 59 
<211> 972 
<212> DNA 

<213> Artificial Sequence 



<220> 
<223> 35F 



<400> 59 

atgaccaaga 

agcacctccc 

agcccagacg 

ctacagctaa 

aacatacgtg 

aatggattag 

aacaacggtg 

ccacctaact 

ttagtaaaaa 

gtgaaccaaa 

tctggaaatc 

acagcgacca 

cccttcaaca 

actagttatg 

atttcttcca 

ccagaaagca 

gacgacgaat 



gagtccggct 
aacacccctt 
gagttcttac 
aagtgggagg 
ctacagcacc 
aaactcaaaa 
acatttgtat 
gtcaaattgt 
atggagggct 
tgttcacaca 
tattaactga 
gtgaaactgt 
ccactactag 
atagaagtct 
atgttgccta 
acatagctac 
aa 



cagtgactcc 
tataaaccca 
tttaaaatgt 
gggacttaca 
cattactaaa 
caataaacta 
aaaggatagt 
ggaaaacact 
tgttaatggc 
aaagacagca 
ggaatcagac 
agccagcagc 
ggatagtgaa 
atttcccttg 
tgccatacaa 
gctgaccaca 



ttcaaccctg 
gggtttattt 
ttaaccccac 
gtggatgaca 
aataatcact 
tgtgccaaat 
attaacacct 
aatacaaatg 
tacgtgtctc 
aacatccaat 
ttaaaaattc 
aaagccttta 
aactacattc 
aacatttcta 
tttgaatgga 
tccccctttt 



tctaccccta 
ccccaaatgg 
taacaaccac 
ctgatggtac 
ctgtagaact 
tgggaaatgg 
tatggactgg 
atggcaaact 
tagttggtgt 
taagattata 
cacttaaaaa 
tgccaagtac 
atggaatatg 
taatgctaaa 
atctaaatgc 
tcttttctta 



tgaagatgaa 60 
cttcacacaa 120 
aggcggatct 180 
cttacaagaa 240 
atccattgga 300 
gttaaaattt 360 
aataaaccct 42 0 
tactttagta 480 
atcagacact 540 
ttttgactct 600 
taaatcttct 660 
tacagcttat 720 
ttactacatg 780 
cagccgtatg 840 
aagtgaatct 90 0 
cattacagaa 960 
972 



<210> 60 
<211> 323 
<212> PRT 

<213> Artificial Sequence 



<220> 
<223> 35F 



<400> 60 



Met 


Thr 


Lys 


Arg 


Val 


Arg 


Leu 


Ser 


Asp 


Ser 


Phe 


Asn 


Pro 


Val 


Tyr 


Pro 


1 






5 










10 










15 




Tyr 


Glu 


Asp 


Glu 


Ser 


Thr 


Ser 


Gin 


His 


Pro 


Phe 


He 


Asn 


Pro 


Gly 


Phe 








20 










25 










30 






He 


Ser 


Pro 


Asn 


Gly 


Phe 


Thr 


Gin 


Ser 


Pro 


Asp 


Gly 


Val 


Leu 


Thr 


Leu 






35 










40 










45 








Lys 


Cys 


Leu 


Thr 


Pro 


Leu 


Thr 


Thr 


Thr 


Gly 


Gly 


Ser 


Leu 


Gin 


Leu 


Lys 




50 










55 










60 










Val 


Gly Gly 


Gly 


Leu 


Thr 


Val 


Asp 


Asp 


Thr 


Asp 


Gly 


Thr 


Leu 


Gin 


Glu 


65 










70 










75 










80 


Asn 


He 


Arg 


Ala 


Thr 


Ala 


Pro 


He 


Thr 


Lys 


Asn 


Asn 


His 


Ser 


Val 


Glu 








85 










90 










95 




Leu 


Ser 


He 


Gly 


Asn 


Gly 


Leu 


Glu 


Thr 


Gin 


Asn 


Asn 


Lys 


Leu 


Cys 


Ala 
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t n n 

1UU 




Leu 


r*l it 
vj-Ly 












Asp 


Ser 


He 


Asn 




130 






Gin 


He 


V d-L 


ri~\ ii 
vj_i_ u. 


145 








Leu 


Val 


Lys 


Asn 


Val 


Ser 


Asp 


Tnr 










Gin 


Leu 


Arg 


Leu 










Ser Asp 


Leu 


Lys 




210 






Glu 


Thr 


Val 


Ala 


225 








Pro 


Phe 


Asn 


Thr 


Cys 


Tyr 


Tyr 


Met 








260 


Ser 


He 


Met 


Leu 






275 




lie 


Gin 


Phe 


Glu 




290 






lie 


Ala 


Thr 


Leu 


305 








Asp 


Asp 


Glu 







Leu 


Lys 


T)"h ^ 

ir I ltr 








120 


Tnr 


Leu 


Trp 


Thr 






135 




Asn 


Thr 


Asn 


Thr 




i en 






r*~\ it 


\j-Ly 


Leu 


Val 


Xo o 








V cl J. 


Asn 


Gin 


Met 


Tyr 




Asp 


Ser 








200 


He 


Pro 


Leu 


Lys 






215 




Ser 


Ser 


Lys 


Ala 




230 






Thr 


Thr 


Arg Asp 


245 








Thr 


Ser 


Tyr 


Asp 


Asn 


Ser 


Arg 


Met 








280 


Trp 


Asn 


Leu 


Asn 






295 




Thr 


Thr 


Ser 


Pro 




310 







105 

Asn Asn Gly Asp 

Gly He Asn Pro 
140 

Asn Asp Gly Lys 
155 

Asn Gly Tyr Val 
170 

Phe Thr Gin Lys 
185 

Ser Gly Asn Leu 

Asn Lys Ser Ser 
220 

Phe Met Pro Ser 
235 

Ser Glu Asn Tyr 
250 

Arg Ser Leu Phe 
265 

He Ser Ser Asn 

Ala Ser Glu Ser 
300 

Phe Phe Phe Ser 
315 



110 

He Cys He Lys 
125 

Pro Pro Asn Cys 

Leu Thr Leu Val 
160 

Ser Leu Val Gly 
175 

Thr Ala Asn He 
190 

Leu Thr Glu Glu 
205 

Thr Ala Thr Ser 

Thr Thr Ala Tyr 
240 

He His Gly He 
255 

Pro Leu Asn He 
270 

Val Ala Tyr Ala 
285 

Pro Glu Ser Asn 

Tyr He Thr Glu 
32 0 



<210> 61 
<211> 1002 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 35F RGD 



<400> 61 

atgaccaaga 

agcacctccc 

agcccagacg 

ctacagctaa 

aacataegtg 

aatggattag 

aacaaeggtg 

ccacctaact 

ttagtaaaaa 

gtgaaccaaa 

tctggaaatc 

acagcgacca 

cccttcaaca 

actagttatg 

atttcttcca 

tttgaatgga 

tccccctttt 



gagtcegget 
aacacccctt 
gagttcttac 
aagtgggagg 
ctacagcacc 
aaactcaaaa 
acatttgtat 
gtcaaattgt 
atggagggct 
tgttcacaca 
tattaactga 
gtgaaactgt 
ccactactag 
atagaagtct 
atgtacattg 
atetaaatge 
tcttttctta 



cagtgactcc 
tataaaccca 
tttaaaatgt 
gggacttaca 
cattactaaa 
caataaacta 
aaaggatagt 
ggaaaacact 
tgttaatggc 
aaagacagca 
ggaatcagac 
agccagcagc 
ggatagtgaa 
atttcccttg 
tgattgtcgt 
aagtgaatct 
cattacagaa 



ttcaaccctg 
gggtttattt 
ttaaccccac 
gtggatgaca 
aataatcact 
tgtgccaaat 
attaacacct 
aatacaaatg 
tacgtgtctc 
aacatccaat 
ttaaaaattc 
aaagecttta 
aactacattc 
aacatttcta 
ggtgattgtt 
ccagaaagca 
gacgacgaat 



tctaccccta 
ccccaaatgg 
taacaaccac 
ctgatggtac 
ctgtagaact 
tgggaaatgg 
tatggactgg 
atggcaaact 
tagttggtgt 
taagattata 
cacttaaaaa 
tgecaagtae 
atggaatatg 
taatgctaaa 
tttgegcata 
acatagctac 
aa 



tgaagatgaa 
cttcacacaa 
aggeggatet 
cttacaagaa 
atccattgga 
gttaaaattt 
aataaaccct 
tactttagta 
atcagacact 
ttttgactct 
taaatcttct 
tacagcttat 
ttactacatg 
cagcegtatg 
tgccatacaa 
gctgaccaca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1002 



<210> 62 
<211> 333 
<212> PRT 
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<213> Artificial Sequence 



<220> 

<223> 35F RGD 



<400> 62 



Met 


Thr 


Lys 


Arg 


Val 
5 


Arg 


Leu 


Ser 


1 
Tyr 


Glu 


Asp 


Glu 


Ser 


Thr 


Ser 


Gin 






20 










He 


Ser 


Pro 


Asn 


Gly 


Phe 


Thr 


Gin 






35 










4 0 


Lys 


Cys 


Leu 


Thr 


Pro 


Leu 


Thr 


Thr 


50 














Val Gly Gly 


Gly 


Leu 


Thr 


Val 


Asp 


65 










70 






Asn 


He 


Arg 


Ala 


Thr 


Ala 


Pro 


He 








85 








Leu 


Ser 


He 


Gly Asn 


Gly 


Leu 


Glu 








100 










Lys 


Leu 


Gly 


Asn Gly 


Leu 


Lys 


Phe 




115 










120 


Asp 


Ser 


He 


Asn 


Thr 


Leu 


Tro 


Thr 


130 










135 




Gin 


He 


Val 


Glu 


Asn 


Thr 


Asn 


Thr 


145 










15 0 






Leu 


Val 


Lys 


Asn Gly 


Gly 


Leu 


Val 










165 








Val 


Ser Asp 


Thr 


Val 


Asn 


Gin 


Met 








180 










Gin 


Leu 


Arg 


Leu 


Tyr 


Phe 


Asp 


Ser 






195 










200 


Ser Asp 


Leu 


Lys 


He 


Pro 


Leu 


Lys 




210 










215 




Glu 


Thr 


Val 


Ala 


Ser 


Ser 


Lys 


Ala 


225 










230 






Pro 


Phe 


Asn 


Thr 


Thr 


Thr 


Arg 


Asp 










245 








Cys 


Tyr 


Tyr 


Met 


Thr 


Ser 


Tyr 


Asp 








260 










Ser 


He 


Met 


Leu 


Asn 


Ser 


Arg 


Met 






275 










280 


Cys 


Arg 


Gly 


Asp 


Cys 


Phe 


Cys 


Ala 




290 










295 




Leu 


Asn 


Ala 


Ser 


Glu 


Ser 


Pro 


Glu 


305 










310 






Ser 


Pro 


Phe 


Phe 


Phe 


Ser 


Tyr 


He 










325 









Asp 


Ser 


Phe 


Asn 


Pro 


Val 


Tyr 


Pro 


10 










15 




His 


Pro 


Phe 


He 


Asn 


Pro 


Gly 


Phe 


25 










30 






Ser 


Pro 


Asp 


Gly 


Val 


Leu 


Thr 


Leu 










45 








Thr 


Gly 


Gly 


Ser 


Leu 


Gin 


Leu 


Lys 








60 










Asp 


Thr 


Asp 


Gly 


Thr 


Leu 


Gin 


Glu 






75 










80 


Thr 


Lys 


Asn 


Asn 


His 


Ser 


Val 


Glu 




90 










95 




Thr 


Gin 


Asn 


Asn 


Lys 


Leu 


Cys 


Ala 


105 










110 






Asn Asn Gly Asp 


He 


Cys 


He 


Lys 










125 








Gly 


He 


Asn 


Pro 


Pro 


Pro 


Asn 


Cys 






140 










Asn Asp 


Gly 


Lys 


Leu 


Thr 


Leu 


Val 






155 










160 


Asn Gly Tyr Val 


Ser 


Leu 


Val 


Gly 




170 










175 




Phe 


Thr 


Gin 


Lys 


Thr 


Ala 


Asn 


He 


185 










190 






Ser Gly Asn 


Leu 


Leu 


Thr 


Glu 


Glu 










205 








Asn 


Lys 


Ser 


Ser 


Thr 


Ala 


Thr 


Ser 






220 










Phe 


Met 


Pro 


Ser 


Thr 


Thr 


Ala 


Tyr 






235 










240 


Ser 


Glu 


Asn 


Tyr 


He 


His 


Gly 


He 




250 








255 




Arg 


Ser 


Leu 


Phe 


Pro 


Leu 


Asn 


He 


265 










270 






He 


Ser 


Ser 


Asn 


Val 


His 


Cys 


Asp 










285 








Tyr 


Ala 


He 


Gin 


Phe 


Glu 


Trp 


Asn 








300 










Ser 


Asn 


He 


Ala 


Thr 


Leu 


Thr 


Thr 






315 










320 


Thr 


Glu Asp 


Asp 


Glu 









330 



<210> 63 
<211> 1164 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 41sF 
<400> 63 

atgaaaagaa ccagaattga agacgacttc aaccccgtct acccctatga caccttctca 60 
actcccagca tcccctatgt agctccgccc ttcgtttctt ctgacgggtt acaggaaaaa 12 0 
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cccccaggag ttttagcact caagtacact gaccccatta ctaccaatgc taagcatgag 180 
cttactttaa aacttggaag caacataact ttagaaaatg ggttactttc ggccacagtt 240 
cccactgttt ctcctcccct tacaaacagt aacaactccc tgggtttagc cacatccgct 300 
cccatagctg tatcagctaa ctctctcaca ttggccaccg ccgcaccact gacagtaagc 360 
aacaaccagc ttagtattaa cgcgggcaga ggtttagtta taactaacaa tgccttaaca 42 0 
gttaatccta ccggagcgct aggtttcaat aacacaggag ctttacaatt aaatgctgca 480 
ggaggaatga gagtggacgg tgccaactta attcttcatg tagcatatcc ctttgaagca 540 
atcaaccagc taacactgcg attagaaaac gggttagaag taaccagcgg aggaaagctt 600 
aacgttaagt tgggatcagg cctccaattt gacagtaacg gacgcattgc tattagtaat 660 
agcaaccgaa ctcgaagtgt accatccctc actaccattt ggtctatctc gcctacgcct 720 
aactgctcca tttatgaaac ccaagatgca aacctatttc tttgtctaac taaaaacgga 780 
gctcacgtat taggtactat aacaatcaaa ggtcttaaag gagcactgcg ggaaatgcac 84 0 
gataacgctc tatctttaaa acttcccttt gacaatcagg gaaatttact taactgtgcc 90 0 
ttggaatcat ccacctggcg ttaccaggaa accaacgcag tggcctctaa tgccttaaca 960 
tttatgccca acagtacagt gtatccacga aacaaaaccg ctcacccggg caacatgctc 102 0 
atccaaatct cgcctaacat caccttcagt gtcgtctaca acgagataaa cagtgggtat 10 8 0 
gcttttactt ttaaatggtc agccgaaccg ggaaaacctt ttcacccacc taccgctgta 1140 
ttttgctaca taactgaaga ataa 1164 

<210> 64 

<211> 387 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 41sF 



<400> 64 



Met 


Lys 


Arg 


Thr Arg 


He 


Glu Asp Asp 


Phe 


Asn 


Pro 


Val 


Tyr 


Pro 


Tyr 


1 




5 










10 










15 




Asp 


Thr 


Phe 


Ser 


Thr 


Pro 


Ser 


He 


Pro 


Tyr 


Val 


Ala 


Pro 


Pro 


Phe 


Val 






20 










25 










30 






Ser 


Ser Asp 


Gly 


Leu 


Gin 


Glu 


Lys 


Pro 


Pro 


Gly Val 


Leu 


Ala 


Leu 


Lys 






35 








40 










45 








Tyr 


Thr Asp 


Pro 


He 


Thr 


Thr 


Asn 


Ala 


L Y S 


His 


Glu 


Leu 


Thr 


Leu 


Lys 


50 










55 










60 






Thr 


Val 


Leu 


Gly 


Ser 


Asn 


He 


Thr 


Leu 


Glu 


Asn 


Gly 


Leu 


Leu 


Ser 


Ala 


65 








70 










75 








Gly 
95 


80 


Pro 


Thr 


Val 


Ser 


Pro 
85 


Pro 


Leu 


Thr 


Asn 


Ser 
90 


Asn 


Asn 


Ser 


Leu 


Leu 


Ala 


Thr 


Ser 


Ala 


Pro 


He 


Ala 


Val 


Ser 


Ala 


Asn 


Ser 


Leu 


Thr 


Leu 


Ala 






100 










105 










110 






Thr 


Ala 


Ala 


Pro 


Leu 


Thr 


Val 


Ser 


Asn 


Asn 


Gin 


Leu 


Ser 


He 


Asn 


Ala 






115 










120 










125 






Thr 


Gly Arg Gly 


Leu 


Val 


He 


Thr 


Asn 


Asn 


Ala 


Leu 


Thr 


Val 


Asn 


Pro 




130 










135 










140 








Ala 


Gly Ala 


Leu 


Gly 


Phe 


Asn 


Asn 


Thr 


Gly 


Ala 


Leu 


Gin 


Leu 


Asn 


Ala 


145 








150 










155 










160 


Gly Gly 


Met 


Arg 


Val 


Asp 


Gly Ala 


Asn 


Leu 


He 


Leu 


His 


Val 


Ala 


Tyr 










165 










170 










175 




Pro 


Phe 


Glu 


Ala 
180 


He 


Asn 


Gin 


Leu 


Thr 
185 


Leu 


Arg 


Leu 


Glu 


Asn 
190 


Gly 


Leu 


Glu 


Val 


Thr 


Ser 


Gly 


Gly 


Lys 


Leu 


Asn 


Val 


Lys 


Leu 


Gly 


Ser 


Gly 


Leu 






195 






200 










205 






Thr 


Gin 


Phe 


Asp 


Ser 


Asn 


Gly 


Arg 


He 


Ala 


He 


Ser 


Asn 


Ser 


Asn 


Arg 




210 








215 










220 






Thr 




Arg 


Ser 


Val 


Pro 


Ser 


Leu 


Thr 


Thr 


He 


Trp 


Ser 


He 


Ser 


Pro 


Pro 


225 










230 










235 










240 


Asn 


Cys 


Ser 


lie 


Tyr 


Glu 


Thr 


Gin Asp 


Ala 


Asn 


Leu 


Phe 


Leu 


Cys 


Leu 








245 










250 










255 




Thr 


Lys 


Asn 


Gly Ala 


His 


Val 


Leu Gly 


Thr 


He 


Thr 


He 


Lys 


Gly 


Leu 
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0 & n 


Lys 


Gly 


Ala. 


Leu 










Pro 


Phe 


Asp 


Asn 




290 






Thr 


Trp 


Arg 


Tyr 


305 








Phe 


Met 


Pro 


Asn 


Gly 


Asn 


Met 


Leu 






340 


Tyr 


Asn 


Glu 


lie 




355 




Glu 


Pro 


Gly 


Lys 




370 






Thr 


Glu 


Glu 




385 









Arg Glu Met His 
280 

Gin Gly Asn Leu 
295 

Gin Glu Thr Asn 
310 

Ser Thr Val Tyr 
325 

lie Gin He Ser 

Asn Ser Gly Tyr 
360 

Pro Phe His Pro 
375 



-52- 



265 

Asp Asn Ala Leu 

Leu Asn Cys Ala 
300 

Ala Val Ala Ser 
315 

Pro Arg Asn Lys 
330 

Pro Asn He Thr 
345 

Ala Phe Thr Phe 

Pro Thr Ala Val 
380 





270 






Ser 


Leu 


Lys 


Leu 


285 








Leu 


Glu 


Ser 


Ser 


Asn 


Ala 


Leu 


Thr 








320 


Thr 


Ala 


His 


Pro 






335 




Phe 


Ser 


Val 


Val 




350 






Lys 


Trp 


Ser 


Ala 


365 








Phe 


Cys 


Tyr 


He 



<210> 65 
<211> 1194 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 41sF RGD 
<400> 65 

atgaaaagaa ccagaattga 
actcccagca tcccctatgt 
cccccaggag ttttagcact 
cttactttaa aacttggaag 
cccactgttt ctcctcccct 
cccatagctg tatcagctaa 
aacaaccagc ttagtattaa 
gttaatccta ccggagcgct 
ggaggaatga gagtggacgg 
atcaaccagc taacactgcg 
aacgt t aagt t gggat cagg 
agcaaccgaa ctcgaagtgt 
aactgctcca tttatgaaac 
gctcacgtat taggtactat 
gataacgctc tatctttaaa 
ttggaatcat ccacctggcg 
tttatgccca acagtacagt 
atccaaatct cgcctaacat 
cgtggtgatt gtttttgtac 
ggaaaacctt ttcacccacc 

<210> 66 
<211> 397 
<212> PRT 
<213> Artificial Sequence 

<220> 

<223> 41SF RGD 
<400> 66 

Met Lys Arg Thr Arg He Glu Asp Asp Phe Asn Pro Val Tyr Pro Tyr 

1 5 10 15 

Asp Thr Phe Ser Thr Pro Ser He Pro Tyr Val Ala Pro Pro Phe Val 



agacgacttc aaccccgtct acccctatga caccttctca 60 
agctccgccc ttcgtttctt ctgacgggtt acaggaaaaa 12 0 
caagtacact gaccccatta ctaccaatgc taagcatgag 180 
caacataact ttagaaaatg ggttactttc ggccacagtt 240 
tacaaacagt aacaactccc tgggtttagc cacatccgct 3 00 
ctctctcaca ttggccaccg ccgcaccact gacagtaagc 3 60 
cgcgggcaga ggtttagtta taactaacaa tgccttaaca 420 
aggtttcaat aacacaggag ctttacaatt aaatgctgca 4 80 
tgccaactta attcttcatg tagcatatcc ctttgaagca 540 
attagaaaac gggttagaag taaccagcgg aggaaagctt 6 00 
cctccaattt gacagtaacg gacgcattgc tattagtaat 660 
accatccctc actaccattt ggtctatctc gcctacgcct 720 
ccaagatgca aacctatttc tttgtctaac taaaaacgga 7 80 
aacaatcaaa ggtcttaaag gagcactgcg ggaaatgcac 84 0 
acttcccttt gacaatcagg gaaatttact taactgtgcc 900 
ttaccaggaa accaacgcag tggcctctaa tgccttaaca 960 
gtatccacga aacaaaaccg ctcacccggg caacatgctc 1020 
caccttcagt gtcgtctaca acgagataaa ctgtgattgt 1080 
tagtgggtat gcttttactt ttaaatggtc agccgaaccg 114 0 
taccgctgta ttttgctaca taactgaaga ataa 1194 
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20 



Ser 


Ser 


Asp Gly 


Leu 


Gin 


Glu 


Lys 






35 










40 


Tvr 


Thr 


Asp 


Pro 


He 


Thr 


Thr 


Asn 


50 










55 




Leu 


Gly 


Ser 


Asn 


He 


Thr 


Leu 


Glu 


65 










70 






Pro 


Thr 


Val 


Ser 


Pro 


Pro 


Leu 


Thr 










85 








Ala 


Thr 


Ser 


Ala 


Pro 


He 


Ala 


Val 








100 










Th.2r 


Ala 


Ala 


Pro 


Leu 


Thr 


Val 


Ser 






115 










120 


Glv 


Arg 


Gly 


Leu 


Val 


He 


Thr 


Asn 




130 










135 




Gly 


Ala 


Leu Gly 


Phe 


Asn 


Asn 


Thr 


145 










150 






Gly 


Gly 


Met 


Arg 


Val 


Asp 


Gly Ala 










165 








Pro 


Phe 


Glu 


Ala 


He 


Asn 


Gin 


Leu 








180 










Glu 


Val 


Thr 


Ser 


Gly 


Gly Lys 


Leu 






195 










200 


Gin 


Phe 


Asp 


Ser 


Asn 


Gly Arg 


He 




210 










215 




Arg 


Ser 


Val 


Pro 


Ser 


Leu 


Thr 


Thr 


225 










230 






Asn 


Cys 


Ser 


He 


Tyr 


Glu 


Thr 


Gin 










245 








Thr 


Lys 


Asn 


Gly Ala 


His 


Val 


Leu 








260 










Lys 


Glv 


Ala 


Leu 


Arg 


Glu 


Met 


'His 






275 










280 


Pro 


Phe 


Asp 


Asn 


Gin 


Gly Asn 


Leu 




290 










295 




Thr 


Trp 


Arg 


Tyr 


Gin 


Glu 


Thr 


Asn 


305 










310 






Phe 


Met 


Pro 


Asn 


Ser 


Thr 


Val 


Tyr 










325 








Gly 


Asn 


Met 


Leu 


He 


Gin 


He 


Ser 








340 










Tyr 


Asn 


Glu 


He 


Asn 


Cys 


Asp 


Cys 






355 










360 


Gly 


Tyr 


Ala 


Phe 


Thr 


Phe 


Lys 


Trp 




370 










375 




His 


Pro 


Pro 


Thr 


Ala 


Val 


Phe 


Cys 


385 










390 







25 










3 0 






Pro 


Pro 


Gly Val 


Leu 


Ala 


Leu 


Lys 










45 








Ala 


Lys 


His 


Glu 


Leu 


Thr 


Leu 


Lys 








60 










Asn 


Gly 


Leu 


Leu 


Ser 


Ala 


Thr 


Val 






75 










80 


Asn 


Ser 


Asn 


Asn 


Ser 


Leu 


Glv 


Leu 




90 










95 




Ser 


Ala 


Asn 


Ser 


Leu 


Thr 


Leu 


Ala 


105 










110 






Asn 


Asn 


Gin 


Leu 


Ser 


He 


Asn 


Ala 










125 








Asn 


Ala 


Leu 


Thr 


Val 


Asn 


Pro 


Thr 








140 










Gly 


Ala 


Leu 


Gin 


Leu 


Asn 


Ala 


Ala 




155 










160 


Asn 


Leu 


He 


Leu 


His 


Val 


Ala 


Tyr 

J; 




170 










175 




Thr 


Leu Arg 


Leu 


Glu 


Asn 


Glv 


Leu 


185 










190 






Asn 


Val 


Lys 


Leu Gly 


Ser 


Gly 

^ J: 


Leu 










205 








Ala 


He 


Ser 


Asn 


Ser 


Asn 


Arg 


Thr 








220 










He 


Trp 


Ser 


He 


Ser 


Pro 


Thr 


Pro 






235 










240 


Asp 


Ala 


Asn 


Leu 


Phe 


Leu 


Cys 


Leu 




250 










255 




Glv 


Thr 


He 


Thr 


He 


Lys 


Glv 


Leu 


265 










270 






Asp 


Asn 


Ala 


Leu 


Ser 


Leu 


Lys 


Leu 










285 








Leu 


Asn 


Cys 


Ala 


Leu 


Glu 


Ser 


Ser 








300 










Ala 


Val 


Ala 


Ser 


Asn 


Ala 


Leu 


Thr 






315 










320 


Pro 


Arg Asn 


Lys 


Thr 


Ala 


His 


Pro 




330 










335 




Pro 


Asn 


He 


Thr 


Phe 


Ser 


Val 


Val 


345 










350 






Arg 


Gly Asp 


Cys 


Phe 


Cys 


Thr 


Ser 










365 








Ser 


Ala 


Glu 


Pro 


Gly Lys 


Pro 


Phe 








380 










Tyr 


He 


Thr 


Glu 


Glu 









395 



<210> 67 
<211> 1737 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ad5 PD1 pent on 
<400> 67 

atgcggcgcg cggcgatgta tgaggaaggt cctcctccct cctacgagag tgtggtgagc 60 
gcggcgccag tggcggcggc gctgggttct cccttcgatg ctcccctgga cccgccgttt 12 0 
gtgcctccgc ggtacctgcg gcctaccggg gggagaaaca gcatccgtta ctctgagttg 18 0 
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gcacccctat 
tccctgaact 
tacagcccgg 
ggcgacctga 
aataagttta 
ctgaaatacg 
atagacctta 
gttctggaaa 
cccgtcactg 
attttgctgc 
atccgcaagc 
ggtaacattc 
gaacagggcg 
tccaacgcgg 
ccctacgacg 
gaggccgaag 
cagaagaaac 
ataagcaatg 
gaccctcaga 
tcggagcagg 
acgcgccaga 
agcttctaca 
cacgtgttca 
accaccgtca 
agcatcggag 
gtttacaagg 



tcgacaccac 
accagaacga 
gggaggcaag 
aaaccatcct 
aggcgcgggt 
agtgggtgga 
tgaacaacgc 
gcgacatcgg 
gtcttgtcat 
caggatgcgg 
ggcaaccctt 
ccgcactgtt 

ggggtggcgc 

cagccgcggc 
tgcccgacta 
cagcggccga 
cggtgatcaa 
acagcacctt 
ccggaatccg 
tctactggtc 
tcagcaactt 
acgaccaggc 
atcgctttcc 
gtgaaaacgt 
gagtccagcg 
ccctgggcat 



ccgtgtgtac 
ccacagcaac 
cacacagacc 
gcataccaac 
gatggtgtcg 
gttcacgctg 
gatcgtggag 
ggtaaagttt 
gcctggggta 
ggtggacttc 
cqaggagggc 
ggatgtggac 
aggcggcagc 
aatgcagccg 
cgcgggcacc 
agctgccgcc 
acccctgaca 
cacccagtac 
ctcatggacc 
gttgccagac 
tccggtggtg 
cgtctactcc 
cgagaaccag 
tcctgctctc 
agtgaccatt 
agtctcgccg 



ctggtggaca 
tttctgacca 
atcaatcttg 
atgccaaatg 
cgcttgccta 
cccgagggca 
cactacttga 
gacacccgca 
tatacaaacg 
acccacagcc 
tttaggatca 
gcctaccagg 
aacagcagtg 
gtggaggaca 
agcgccacac 
cccgctgcgc 
gaggacagca 
cgcagctggt 
ctgctttgca 
atgatgcaag 
ggcgccgagc 
caactcatcc 
attttggcgc 
acagatcacg 
actgacgcca 
cgcgtcctat 



acaagtcaac 
cggtcattca 
acgaccggtc 
tgaacgagtt 
ctaaggacaa 
actactccga 
aagtgggcag 
acttcagact 
aagccttcca 
gcctgagcaa 
cctacgatga 
cgagcttgaa 
gcagcggcgc 
tgaacgatag 
gggctgagga 
aacccgaggt 
agaaacgcag 
accttgcata 
ctcctgacgt 
accccgtgac 
tgttgcccgt 
gccagtttac 
gcccgccagc 
ggacgctacc 
gacgccgcac 
cgagccgcac 



ggatgtggca 
aaacaatgac 
gcactggggc 
catgtttacc 
tcaggtggag 
gaccatgacc 
acagaacggg 
ggggtttgac 
tccagacatc 
cttgttgggc 
tctggagggt 
agatgacacc 
ggaagagaac 
ccgcggctac 
gaagcgcgct 
cgagaagcct 
ttacaaccta 
caactacggc 
aacctgcggc 
cttccgctcc 
gcactccaag 
ctctctgacc 
ccccaccatc 
gctgcgcaac 
ctgcccctac 
tttttga 



<210> 68 
<211> 578 
<212> PRT 

<213> Artificial Sequence 
<220> 

<2 23> Ad5 PD1 pent on 
<400> 68 

Met Arg Arg Ala Ala Met Tyr Glu Glu 

1 ~ ~ 5 
Ser Val Val Ser Ala Ala Pro Val Ala 

20 25 
Asp Ala Pro Leu Asp Pro Pro Phe Val 

35 40 
Thr Gly Gly Arg Asn Ser lie Arg Tyr 

50 55 
Asp Thr Thr Arg Val Tyr Leu Val Asp 
65 70 
Ser Leu Asn Tyr Gin Asn Asp His Ser 
85 

Gin Asn Asn Asp Tyr Ser Pro Gly Glu 
100 " ~ 105 

Leu Asp Asp Arg Ser His Trp Gly Gly 

115 12 0 

Thr Asn Met Pro Asn Val Asn Glu Phe 

130 135 
Ala Arg Val Met Val Ser Arg Leu Pro 
145 15 0 

Leu Lys Tyr Glu Trp. Val Glu Phe Thr 
165 

Glu Thr Met Thr lie Asp Leu Met Asn 
180 185 
Leu Lys Val Gly Arg Gin Asn Gly Val 



Gly Pro 
10 

Ala Ala 

Pro Pro 

Ser Glu 

Asn Lys 

75 
Asn Phe 
90 

Ala Ser 

Asp Leu 

Met Phe 

Thr Lys 
155 
Leu Pro 
170 

Asn Ala 



Pro Pro 

Leu Gly 

Arg Tyr 

45 
Leu Ala 
60 

Ser Thr 

Leu Thr 

Thr Gin 

Lys Thr 
125 
Thr Asn 
140 

Asp Asn 



Glu Gly 
lie Val 
Leu Glu Ser Asp 



Ser 

Ser 

30 

Leu 

Pro 

Asp 

Thr 

Thr 
110 
lie 

Lys 

Gin 

Asn 

Glu 
190 
He 



Tyr Glu 
15 

Pro Phe 

Arg Pro 

Leu Phe 

Val Ala 
80 

Val He 
95 

He Asn 

Leu His 

Phe Lys 

Val Glu 
160 
Tyr Ser 
175 

His Tyr 
Gly Val 



240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1737 
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195 



Lys 


Phe 


Asp 


Thr 




210 






Leu 


Val 


Met 


Pro 


225 








lie 


Leu 


Leu 


Pro 


Asn 


Leu 


Leu 


Gly 








260 


He 


Thr 


Tyr 


Asp 






275 




Val 


Asp 


Ala 


Tyr 




290 






Gly 


Gly 


Ala 


Gly 


305 








Ser 


Asn 


Ala 


Ala 


Ser 


Arg 


Gly 


Tyr 








340 


Thr 


Arg 


Ala 


Glu 






355 




"TV "I 

Ala 


Ala 


Pro 


Ala 




370 






Val 


He 


Lys 


Pro 


385 








He 


Ser 


Asn 


Asp 


Tyr 


Asn 


Tyr 


Gly 








420 


Cys 


Thr 


Pro 


Asp 






435 




Pro 


Asp 


Met 


Met 




450 






Ser 


Asn 


Phe 


Pro 


465 








Ser 


Phe 


Tyr 


Asn 


Thr 


Ser 


Leu 


Thr 








500 


Ala 


Arg 


Pro 


Pro 






515 




Ala 


Leu 


Thr 


Asp 




530 






Val 


Gin 


Arg 


Val 


545 








Val 


Tyr 


Lys 


Ala 


Thr 


Phe 







200 

Arg Asn Phe Arg 
215 

Gly Val Tyr Thr 
230 

Gly Cys Gly Val 
245 

He Arg Lys Arg 

Asp Leu Glu Gly 
280 

Gin Ala Ser Leu 
295 

Gly Ser Asn Ser 
310 

Ala Ala Ala Met 
325 

Pro Tyr Asp Val 

Glu Lys Arg Ala 
360 

Ala Gin Pro Glu 
375 

Leu Thr Glu Asp 
390 

Ser Thr Phe Thr 
405 

Asp Pro Gin Thr 

Val Thr Cys Gly 
44 0 

Gin Asp Pro Val 
455 

Val Val Gly Ala 
470 

Asp Gin Ala Val 
485 

His Val Phe Asn 

Ala Pro Thr He 
520 

His Gly Thr Leu 
535 

Thr He Thr Asp 
550 

Leu Gly He Val 
565 



Leu Gly 


Phe 


Asp 








220 


Asn 


Glu 


Ala 


Phe 






235 




Asp 


Phe 


Thr 


His 




250 






Gin 


Pro 


Phe 


Gin 


265 








Gly Asn 


He 


Pro 


Lys 


Asp 


Asp 


Thr 








300 


Ser 


Q iy 


Ser 


Gly 






315 




Gin 


Pro 


Val 


Glu 




330 






Pro 


Asp 


Tyr 


Ala 


345 








Glu 


Ala 


Glu 


Ala 


Val 


Glu 


Lys 


Pro 








380 


Ser 


Lys 


Lys 


Arg 






395 




Gin 


Tyr 


Arg 


Ser 




410 






Gly 


He 


Arg 


Ser 


425 








Ser 


Glu 


Gin 


Val 


Thr 


Phe 


Arg 


Ser 








460 


Glu 


Leu 


Leu 


Pro 






475 




Tyr 


Ser 


Gin 


Leu 




490 






Arg 


Phe 


Pro 


Glu 


505 








Thr 


Thr 


Val 


Ser 


Pro 


Leu 


Arg 


Asn 








540 


Ala Arg 


Arg 


Arg 






555 




Ser 


Pro 


Arg 


Val 




570 







205 








Pro 


Val 


Thr 


Gly 


His 


Pro 


Asp 


He 








240 


Ser Arg 


Leu 


Ser 






255 




Glu 


Gly 


Phe 


Arg 




270 






Ala 


Leu 


Leu 


Asp 


285 








Glu Gin Gly 


Gly 


Ala 


Glu 


Glu 


Asn 








320 


Asp 


Met 


Asn 


Asp 






335 




Gly Thr 


Ser 


Ala 




350 






Ala 


Ala 


Glu 


Ala 


365 








Gin 


Lys 


L ys 


Pro 


Ser 


Tyr 


Asn 


Leu 








400 


Trp 


Tyr 


Leu 


Ala 






415 




Trp 


Thr 


Leu 


Leu 




430 






Tyr 


Trp 


Ser 


Leu 


445 








Thr 


Arg 


Gin 


He 


Val 


His 


Ser 


Lys 








480 


He 


Arg 


Gin 


Phe 






495 




Asn 


Gin 


He 


Leu 




510 






Glu 


Asn 


Val 


Pro 


525 








Ser 


He 


Gly 


Gly 


Thr 


Cys 


Pro 


Tyr 








560 


Leu 


Ser 


Ser 


Arg 






575 





<210> 69 
<211> 1773 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 5TS35H 
<400> 69 

atgaagcgcg caagaccgtc tgaagatacc ttcaaccccg tgtatccata tgacacggaa 60 
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accggtcctc 
gagagtcccc 
atgcttgcgc 
caaaatgtaa 
atatctgcac 
atggtcgcgg 
aaacttagca 
acatcaggcc 
ctaactactg 
ggaaaactag 
accgtagcaa 
ggagccttgg 
attgattctc 
caactaaatc 
attaactaca 
gttaacctaa 
ggagatgggc 
attggccatg 
cttagttttg 
ttgtggaccg 
gatggcaaac 
ctagttggtg 
ttaagattat 
ccacttaaaa 
atgccaagta 
catggaatat 
ataatgctaa 
aatctaaatg 
ttcttttctt 



caactgtgcc 
ctggggtact 
tcaaaatggg 
ccactgtgag 
ccctcacagt 
gcaacacact 
ttgccaccca 
ccctcaccac 
ccactggtag 
gactaaagta 
ctggtccagg 
gttttgattc 
aaaacagacg 
taagactagg 
acaaaggcct 
gcactgccaa 
ttgaatttgg 
gcctagaatt 
acagcacagg 
gaataaaccc 
ttactttagt 
tatcagacac 
attttgactc 
ataaatcttc 
ctacagctta 
gttactacat 
acagccgtat 
caagtgaatc 
acattacaga 



ttttcttact 
ctctttgcgc 
caacggcctc 
cccacctctc 
tacctcagaa 
caccatgcaa 
aggacccctc 
caccgatagc 
cttgggcatt 
cggggctcct 
tgtgactatt 
acaaggcaat 
ccttatactt 
acagggccct 
ttacttgttt 
ggggttgatg 
ttcacctaat 
tgattcaaac 
tgccattaca 
tccacctaac 
attagtaaaa 
tgtgaaccaa 
ttctggaaat 
tacagcgacc 
tcccttcaac 
gactagttat 
gatttcttcc 
tccagaaagc 
agacgacgaa 



cctccctttg 
ctatccgaac 
tctctggacg 
aaaaaaacca 
gccctaactg 
tcacaggccc 
acagtgtcag 
agtaccctta 
gacttgaaag 
ttgcatgtaa 
aataatactt 
atgcaactta 
gatgttagtt 
ctttttataa 
acagcttcaa 
tttgacgcta 
gcaccaaaca 
aaggctatgg 
gtaggaaaca 
tgtcaaattg 
aatggagggc 
atgttcacac 
ctattaactg 
agtgaaactg 
accactacta 
gatagaagtc 
aatgttgcct 
aacatagcta 
taa 



tatcccccaa 
ctctagttac 
aggccggcaa 
agtcaaacat 
tggctgccgc 
cgctaaccgt 
aaggaaagct 
ctatcactgc 
agcccattta 
cagacgacct 
ccttgcaaac 
atgtagcagg 
atccgtttga 
actcagccca 
acaattccaa 
cagccatagc 
caaatcccct 
ttcctaaact 
aaaataatga 
tggaaaacac 
ttgttaatgg 
aaaagacagc 
aggaatcaga 
tagccagcag 
gggatagtga 
tatttccctt 
atgccataca 
cgctgaccac 



tgggtttcaa 
ctccaatggc 
ccttacctcc 
aaacctggaa 
cgcacctcta 
gcacgactcc 
agccctgcaa 
ctcaccccct 
tacacaaaat 
aaacactttg 
taaagttact 
aggactaagg 
tgctcaaaac 
caacttggat 
aaagcttgag 
cattaatgca 
caaaacaaaa 
aggaactggc 
taagctaact 
taatacaaat 
ctacgtgtct 
aaacatccaa 
cttaaaaatt 
caaagccttt 
aaactacatt 
gaacatttct 
atttgaatgg 
atcccccttt 



120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560- 

1620' 

1680 

1740 

1773 



<210> 70 
<211> 590 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 5TS35H 



<400> 70 



Met 


L Y S 


Arg 


Ala 


Arg 


Pro 


Ser 


Glu 


Asp 


Thr Phe Asn Pro Val 


Tyr 


Pro 


1 






5 










10 


15 




Tyr 


Asp 


Thr 


Glu 


Thr 


Gly 


Pro 


Pro 


Thr 


Val Pro Phe Leu Thr 


Pro 


Pro 




20 










25 


30 






Phe 


Val 


Ser 


Pro 


Asn 


Gly 


Phe 


Gin 


Glu 


Ser Pro Pro Gly Val 


Leu 


Ser 






35 










40 




45 






Leu 


Arg 


Leu 


Ser 


Glu 


Pro 


Leu 


Val 


Thr 


Ser Asn Gly Met Leu 


Ala 


Leu 




50 










55 






60 






Lys 


Met 


Gly 


Asn 


Gly 


Leu 


Ser 


Leu 


Asp 


Glu Ala Gly Asn Leu 


Thr 


Ser 


65 










70 








75 




80 


Gin 


Asn 


Val 


Thr 


Thr 


Val 


Ser 


Pro 


Pro 


Leu Lys Lys Thr Lys 


Ser 


Asn 










85 










90 


95 




He 


Asn 


Leu 


Glu 


He 


Ser 


Ala 


Pro 


Leu 


Thr Val Thr Ser Glu 


Ala 


Leu 








100 










105 


110 






Thr 


Val 


Ala 


Ala 


Ala 


Ala 


Pro 


Leu 


Met 


Val Ala Gly Asn Thr Leu Thr 






115 










120 




125 






Met 


Gin 


Ser 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


His Asp Ser Lys Leu 


Ser 


He 




130 










135 






140 






Ala 


Thr 


Gin 


Gly 


Pro 


Leu 


Thr 


Val 


Ser 


Glu Gly Lys Leu Ala 


Leu 


Gin 


145 








150 








155 




160 


Thr 


Ser 


Gly 


Pro 


Leu 


Thr 


Thr 


Thr 


Asp 


Ser Ser Thr Leu Thr 


He 


Thr 








165 










170 


175 
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Ala 


Ser 


Pro 


Pro 


Leu 


Thr 


Thr 


Ala 


Thr 








180 










185 


L Y S 


Glu 


Pro 


He 


Tyr 


Thr 


Gin 


Asn 


Gly 






195 










200 




Ala 


Pro 


Leu 


His 


Val 


Thr 


Asp 


Asp 


Leu 




210 










215 






Gly 


Pro 


Gly 


Val 


Thr 


He 


Asn 


Asn 


Thr 


225 










230 








Gly Ala 


Leu 


Gly 


Phe 


Asp 


Ser 


Gin 


Gly 










245 










Gly 


Gly 


Leu 


Arg 


He 


Asp 


Ser 


Gin 


Asn 








260 










265 


Ser 


Tyr 


Pro 


Phe 


Asp 


Ala 


Gin 


Asn 


Gin 






275 










280 




Gly 


Pro 


Leu 


Phe 


He 


Asn 


Ser 


Ala 


His 




290 










295 






Lys 


Gly 


Leu 


Tyr 


Leu 


Phe 


Thr 


Ala 


Ser 


305 










310 








Val 


Asn 


Leu 


Ser 


Thr 


Ala 


Lys 


Gly 


Leu 










325 








Ala 


He 


Asn 


Ala 


Gly 


Asp 


Gly 


Leu 


Glu 








340 










345 


Asn 


Thr 


Asn 


Pro 


Leu 


Lys 


Thr 


Lys 


He 






355 










360 




Ser 


Asn 


Lys 


Ala 


Met 


Val 


Pro 


Lys 


Leu 




370 










375 






Ser 


Thr 


Gly 


Ala 


He 


Thr 


Val 


Gly Asn 


385 










390 








Leu 


Trp 


Thr 


Gly 


He 


Asn 


Pro 


Pro 


Pro 










405 










Thr 


Asn 


Thr 


Asn 


Asp 


Gly 


Lys 


Leu 


Thr 








420 










425 


Gly Leu 


Val 


Asn 


Gly 


Tyr 


Val 


Ser 


Leu 






435 










440 




Asn 


Gin 


Met 


Phe 


Thr 


Gin 


Lys 


Thr 


Ala 




450 










455 






Phe 


Asp 


Ser 


Ser 


Gly 


Asn 


Leu 


Leu 


Thr 


465 










470 








Pro 


Leu 


Lys 


Asn 


Lys 


Ser 


Ser 


Thr 


Ala 








485 










Ser 


Lys 


Ala 


Phe 


Met 


Pro 


Ser 


Thr 


Thr 






500 










505 


Thr 


Arg 


Asp 


Ser 


Glu 


Asn 


Tyr 


He 


His 






515 










520 




Ser 


Tyr 


Asp 


Arg 


Ser 


Leu 


Phe 


Pro 


Leu 




530 










535 






Ser 


Arg 


Met 


He 


Ser 


Ser 


Asn 


Val 


Ala 


545 










550 








Asn 


Leu 


Asn 


Ala 


Ser 


Glu 


Ser 


Pro 


Glu 










565 










Thr 


Ser 


Pro 


Phe 


Phe 


Phe 


Ser 


Tyr 


He 








580 








585 
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Gly 


Ser 


Leu 


Gly 


He 


Asp 


Leu 










190 






Lys 


Leu Gly Leu 


Lys 


Tyr 


Gly 








205 








Asn 


JL lAJu 


Leu 


T'Vi v- 
JL Il-L 


Val 


Ala 


Thr 






220 










Ser 


Leu 


Gin 


Thr 


Lys 


Val 


Thr 




235 










240 


Asn 


Met 


Gin 


Leu 


Asn 


Val 


Ala 


250 








i 


255 




Arg 


Arg 


Leu 


He 


Leu 


Asp 


Val 










270 






Leu 


Asn 


Leu 


Arg 


Leu 


Gly 


Gin 








285 








Asn 


Leu 


Asp 


He 


Asn 


Tyr 


Asn 






300 










Asn 


Asn 


Ser 


Lys 


Lys 


Leu 


Glu 




315 










320 


Met 


Phe 


Asp 


Ala 


Thr 


Ala 


He 


330 










335 




Phe 


Gly 


Ser 


Pro 


Asn 


Ala 


Pro 










350 






Gly 


His 


Gly Leu Glu 


Phe 


Asp 








365 








Gly 

2 


Thr Gly Leu 


Ser 


Phe 


Asp 






380 










Lys 


Asn Asn Asp 


Lys 


Leu 


Thr 




395 










400 


Asn 


Cys 


Gin 


He 


Val 


Glu 


Asn 


410 










415 




Leu 


Val 


Leu 


Val 


Lys 


Asn 


Gly 










430 






Val 


Gly Val 


Ser 


Asp 


Thr 


Val 








445 








Asn 


He 


Gin 


Leu Arg 


Leu 


Tyr 






460 










Glu 


Glu 


Ser Asp 


Leu 


Lys 


He 




475 










480 


Thr 


Ser 


Glu 


Thr 


Val 


Ala 


Ser 


490 










495 




Ala 


Tyr 


Pro 


Phe 


Asn 


Thr 


Thr 








510 






Gly 


He 


Cys 


Tyr 


Tyr 


Met 


Thr 








525 








Asn 


He 


Ser 


He 


Met 


Leu 


Asn 






540 










Tyr 


Ala 


He 


Gin 


Phe 


Glu 


Trp 




555 










560 


Ser 


Asn 


He 


Ala 


Thr 


Leu 


Thr 


570 










575 




Thr 


Glu 


Asp 


Asp 


Glu 







590 
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<400> 71 

atgaccaaga 

agcacctccc 

agcccagacg 

ctacagctaa 

aacatacgtg 

aatggattag 

aacaacggtg 

tctcctaact 

tgtggcagtc 

atatctggaa 

ctaaacaatt 

ggcacagcct 

cacggtaaaa 

aaacctgtaa 

agtgcatact 

tttgccacat 



gagtccggct 
aacacccctt 
gagttcttac 
aagtgggagg 
ctacagcacc 
aaactcaaaa 
acatttgtat 
gtagactaaa 
aaatacttgc 
cagttcaaag 
ccttcctgga 
atacaaacgc 
ctgccaaaag 
cactaaccat 
ctatgtcatt 
cctcttacac 



cagtgactcc 
tataaaccca 
tttaaaatgt 
gggacttaca 
cattactaaa 
caataaacta 
aaaggatagt 
tgcagagaaa 
tacagtttca 
tgctcatctt 
cccagaatat 
tgttggattt 
taacattgtc 
tacactaaac 
ttcatgggac 
tttttcatac 



ttcaaccctg 
gggtttattt 
ttaaccccac 
gtggatgaca 
aataatcact 
tgtgccaaat 
attaacacct 
gatgctaaac 
gttttggctg 
attataagat 
tggaacttta 
atgcctaacc 
agtcaagttt 
ggtacacagg 
tggtctggcc 
attgcccaag 



tctaccccta 
ccccaaatgg 
taacaaccac 
ctgatggtac 
ctgtagaact 
tgggaaatgg 
tatggactac 
tcactttggt 
ttaaaggcag 
ttgacgaaaa 
gaaatggaga 
tatcagctta 
acttaaacgg 
aaacaggaga 
acaactacat 
aataa 



tgaagatgaa 60 
cttcacacaa 120 
aggcggatct 180 
cttacaagaa 24 0 
atccattgga 300 
gttaaaattt 3 60 
accagctcca 42 0 
cttaacaaaa 480 
tttggctcca 540 
tggagtgcta 60 0 
tcttactgaa 660 
tccaaaatct 720 
agacaaaact 780 
cacaactcca 840 
taatgaaata 90 0 
945 



<210> 72 
<211> 314 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 35TS5H 



<400> 72 



Met 


Thr 


Lys 


Arg 


Val 


Arg 


Leu 


Ser 


Asp 


Ser 


Phe 


Asn 


Pro 


Val 


Tyr 


Pro 


1 








5 










10 










15 




Tyr 


Glu 


Asp 


Glu 


Ser 


Thr 


Ser 


Gin 


His 


Pro 


Phe 


He 


Asn 


Pro 


Gly 


Phe 








20 










25 










30 






lie 


Ser 


Pro 


Asn 


Gly 


Phe 


Thr 


Gin 


Ser 


Pro 


Asp 


Gly 


Val 


Leu 


Thr 


Leu 






35 










40 










45 








Lys 


Cys 


Leu 


Thr 


Pro 


Leu 


Thr 


Thr 


Thr 


Gly 


Gly 


Ser 


Leu 


Gin 


Leu 


Lys 




50 










55 










60 










Val 


Gly 


Gly 


Gly 


Leu 


Thr 


Val 


Asp 


Asp 


Thr 


Asp 


Gly 


Thr 


Leu 


Gin 


Glu 


65 










70 










75 










80 


Asn 


lie 


Arg 


Ala 


Thr 


Ala 


Pro 


He 


Thr 


Lys 


Asn 


Asn 


His 


Ser 


Val 


Glu 










85 










90 










95 




Leu 


Ser 


lie 


Gly 


Asn 


Gly 


Leu 


Glu 


Thr 


Gin 


Asn 


Asn 


Lys 


Leu 


Cys 


Ala 








100 










105 










110 






Lys 


Leu 


Gly 


Asn 


Gly 


Leu 


Lys 


Phe 


Asn 


Asn 


Gly 


Asp 


He 


Cys 


He 


Lys 






115 










120 










125 








Asp 


Ser 


He 


Asn 


Thr 


Leu 


Trp 


Thr 


Thr 


Pro 


Ala 


Pro 


Ser 


Pro 


Asn 


Cys 




130 










135 










140 










Arg 


Leu 


Asn 


Ala 


Glu 


Lys 


Asp 


Ala 


Lys 


Leu 


Thr 


Leu 


Val 


Leu 


Thr 


Lys 


145 










150 










155 










160 


Cys 


Gly 


Ser 


Gin 


He 


Leu 


Ala 


Thr 


Val 


Ser 


Val 


Leu 


Ala 


Val 


Lys 


Gly 










165 










170 










175 




Ser 


Leu 


Ala 


Pro 


He 


Ser 


Gly 


Thr 


Val 


Gin 


Ser 


Ala 


His 


Leu 


He 


He 








180 








185 










190 






Arg 


Phe 


Asp 


Glu 


Asn 


Gly 


Val 


Leu 


Leu 


Asn 


Asn 


Ser 


Phe 


Leu 


Asp 


Pro 






195 










200 










205 








Glu 


Tyr 


Trp 


Asn 


Phe 


Arg 


Asn 


Gly 


Asp 


Leu 


Thr 


Glu 


Gly Thr Ala 


Tyr 




210 










215 










220 










Thr 


Asn 


Ala 


Val 


Gly 


Phe 


Met 


Pro 


Asn 


Leu 


Ser 


Ala 


Tyr 


Pro 


Lys 


Ser 


225 










230 










235 










240 


His 


Gly 


Lys 


Thr 


Ala 


Lys 


Ser 


Asn 


He 


Val 


Ser 


Gin 


Val 


Tyr 


Leu 


Asn 






245 








250 








255 




Gly Asp 


Lys 


Thr 


Lys 


Pro 


Val 


Thr 


Leu 


Thr 


He 


Thr 


Leu 


Asn 


Gly Thr 








260 










265 










270 
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Gln Glu Thr Gly Asp Thr Thr Pro Ser Ala Tyr Ser Met Ser Phe Ser 

275 280 285 

Trp Asp Trp Ser Gly His Asn Tyr He Asn Glu He Phe Ala Thr Ser 

290 295 300 

Ser Tyr Thr Phe Ser Tyr lie Ala Gin Glu 
305 310 
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